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Turkey censors evolution 


Turkey's government has done more for science than many. A row over a censored 
magazine and a sacked editor could put the good work at risk. 


t has been the biggest crisis in Turkish academia since last year’s 
lifting of the headscarf ban in universities. Last week a portrait of 
Charles Darwin was taken off the cover of the March issue of the 
government-backed science magazine Bilim ve Teknik (Science and 
Technology) just before it went to press. TUBITAK, Turkey’s national 
science funding agency, which publishes the magazine, then sacked 
its editor, Cigdem Atakuman. Scientists, assuming censorship, are 
justifiably outraged and protests are ongoing. 

Science minister Mehmet Aydin, a historian of philosophy and reli- 
gion, expressed discomfort at the cover’s removal — but also fanned 
the flames by commenting: “What kind of a fight can we have with 
Darwin? The guy is already dead.” He made matters worse by later 
adding: “[TUBITAK] is supposed to reflect the views of all those who 
have served science, no matter how mistaken they can be.” 

TUBITAK vice-president Omer Cebeci, who sits on the magazine’s 
editorial board, pulled the plug on Darwin. He denied censorship, 
charging that Atakuman had secretly changed an issue intended to 
cover global warming. Not true, says Atakuman, who says Cebeci told 
her that the Darwin cover was a “provocation” at a time of imminent 
local elections. One editorial-board member of Bilim ve Teknik has 
resigned in protest at what he, at least, considers censorship. 

This row has brought into focus two issues that plague Turkish 
science. One is political interference in the scientific civil service; the 
other is high levels of public support for creationism. 

In Turkey, as in many countries, the civil service is expected to 
mirror the ruling party’s ideology. So, although they are keen funders 
of research, most senior government officials, in common with most 
of the population, do not believe in evolution by natural selection. 
The education minister Hiiseyin Celik, for example, has proclaimed 
his belief in intelligent design. Yet Turkey is one of three current 


candidates for membership of the European Union (EU). Ankara 
and Istanbul house the largest overseas missions of the European 
Commission, whose officials are monitoring all aspects of Turkish 
public life and constantly advising 
on what needs to be done to harmo- 
nize laws and practices with those 
in the EU. 

Science and technology was one 
of the first ‘chapters’ that the EU said 
it was satisfied with and that, pro- 
visionally, did not require further 
reform. Officials will now almost 
certainly be alarmed to see the extent of political interference in 
TUBITAK. 

Only last week, the European Parliament issued a report stating 
that Turkey needed to make much faster progress in areas such as 
censorship. European officials will see this latest episode as evidence 
that the country has some way to go on that score. 

TUBITAK needs to initiate a transparent investigation into the 
Bilim ve Teknik affair. The organization should also consider making 
an unambiguous statement of its position on evolution, intelligent 
design and creationism to reconfirm its credentials as a serious sci- 
entific body. In the past, TUBITAK has provided reliable informa- 
tion on Darwin’s theory in a country where creation is offered as an 
alternative to evolution in high-school biology teaching. The agency 
could do that again. After all, none of the world’s religions commands 
its believers to be creationists. Many Islamic scholars and thinkers 
have speculated on the origins of life. 

Turkey’s ruling party must learn from this latest affair. It must keep 
religion out of science policy, and be seen to be doing so. a 


“Although they are keen 
funders of research, 
most senior officials, in 
common with most of 
the population, do not 
believe in evolution.” 


Health highway 


President Obama's funds for electronic health 
records should prompt research — and controversy. 


US stimulus bill is that 53 of its 407 pages are devoted to set- 

ting up the administrative framework for a national system 

of electronic health records (EHRs). At the head of this framework, 

which encompasses committees on standards, policy and much else, 

will be a National Coordinator for Health Information Technology, 
located within the Department of Health and Human Services. 

Many will find it even more remarkable that this portion of the 

stimulus bill builds upon foundations laid by former president George 

W. Bush, who established the office of the national coordinator by 


C) ne of the most striking, if little noted, aspects of the recent 


executive order in 2004, and set him the goal of giving every American 
access to EHRs by 2014. Now, with the stimulus bill, Congress and the 
Obama administration have not only embraced that goal, but have 
allocated $19 billion to help reach it. 

Better still, from the research perspective, the stimulus bill explic- 
itly envisions scientists being able to use these clinical records for 
a wide variety of population studies — monitoring the spread of 
flu, tuberculosis and other diseases, for example, or spotting people 
who might be good candidates for clinical trials, or looking for signs 
of adverse reactions after a drug has been released on the market 
(see page 278). 

Many researchers believe that such studies could ultimately pro- 
duce benefits as least as big as the effects of electronic records on 
clinical care — but only if some major issues about data access get 
resolved first. 
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Imagine, for example, that Mrs Smith checks into the hospital for 
hip-replacement surgery. Her doctors will be able to call up the elec- 
tronic records of her personal physician, her physical therapist, her 
pharmacist and everyone else who provides her with health care. But 
this scenario poses some knotty questions. Even leaving aside the 
technical challenge of securing all those far-flung databases against 
attacks by hackers, who has the right to access any given database? 
And how much are they allowed to see there — everything, or just 
those portions of the data they need for their jobs, keeping the rest 
private? And who decides? Patients? Doctors? Hospitals? Anonymi- 
zation and pseudonymization, in which identities are masked but all 
data can still be ascribed to respective individuals, protect individu- 
als while enabling research, but how should such measures best be 
implemented? 

These questions have been contentious even in the United King- 
dom, which has deployed an electronic records system through its 
unified National Health Service. They will be even more contentious 
in the United States, where the health system is anything but unified, 
and where privacy concerns run deep. 

The scientific questions are just as knotty. For example, will each 
patient have to give permission each time a researcher wants to access 
his or her records? That sounds reasonable — and indeed, this is the 


spirit of the strict data privacy and security provisions of the federal 
Health Insurance Portability and Accountability Act. But that would 
make using the records for, say, routine epidemiology prohibitively 
cumbersome. There needs to be a clear policy allowing for such 
uses, as long as the researchers have no access to any aspect of the 
data that reveal a person’s identity. But at the same time, the health- 
records system should provide individuals with an audit trail that 
shows them who has been looking at their records, and what they 
have seen there. 

Or consider another scenario: a research group that wants to doa 
study aggregating EHR data over a large area. Will they have to get 
approval from every Institutional Review Board (IRB) — which over- 
see all research on human subjects in the United States — in every 
hospital and university involved? Common sense suggests that one 
IRB process is enough — and then only if the research uses person- 
ally identifiable data. But mechanisms to coordinate the oversight 
will have to be created. 

The national coordinator and his advisory committees will have 
their hands full with these and many other issues. But the good news 
is that a management framework is in now place to address them, and 
to build an EHR system that will have the confidence of professionals 
and public alike. | 


Filling the void 


As science journalism declines, scientists 
must rise up and reach out. 


Geneva, Switzerland, opened the wine last week to celebrate 

the twentieth anniversary of the laboratory's invention of the 
World Wide Web. The scientists were also joined by around 60 mem- 
bers of the media, who may have been in a less festive mood. Even 
before the current economic crisis, the web was inflicting much pain 
on the mass media. Circulations have dropped, advertising has dried 
up and newspapers have been forced to lay off reporters and scale 
back coverage. A similar slump has hit the broadcast market, with 
no end in sight. 

Science journalism is one of the numerous casualties in this media 
meltdown. Many science journalists are losing their jobs, and those 
who remain are being asked to provide content for blogs, podcasts, 
online videos and other new media (see page 274). Although it is 
difficult to know what effect these cutbacks have had on the public’s 
understanding of science, the general feeling is that the quality of 
science coverage in the conventional media is declining — as is the 
media's ability to play a watchdog role in science, ferreting out fraud 
or other misconduct. 

True, there is no shortage of scientific information on the web. 
Witness the way that research funding agencies use the web to inform 
the public about everything from planetary missions to public health. 
In principle, anyone with an Internet connection now has access to 
more, and better, scientific coverage than ever before. 

In practice, however, this sort of information reaches only those 


S cientists at CERN, Europe’s particle-physics laboratory near 
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who seek it out. An average citizen is unlikely to search the web for 
the Higgs boson or the proteasome if he or she doesn't hear about it 
first on, say, a cable news channel. And as mass media sheds its sci- 
entific expertise, science’s mass-market presence will become harder 
to maintain. 

Harder, but not impossible. For example, scientists are blogging 
in ever increasing numbers, and the most popular blogs draw hun- 
dreds of thousands of readers each month. These blogging scientists 
not only offer expertise for free, but have emerged as an important 
resource for reporters. A Nature survey of nearly 500 science journal- 
ists shows that most have used a scientist’s blog in developing story 
ideas. And a handful of universities, meanwhile, have started environ- 
mental publications that are run jointly by scientists and journalists. 
These publications seek to provide their journalistically valid, scien- 
tifically accurate content free of charge to the mainstream press. 

Sadly, these activities live on the fringe of the scientific enterprise. 
Blogging will not help, and could even hurt, a young researcher's 
chances of tenure. Many of their elders still look down on colleagues 
who blog, believing that research should be communicated only 
through conventional channels such as peer-review and publication. 
Indeed, many researchers are hesitant even to speak to the popular 
press, for fear of having their carefully chosen words twisted beyond 
recognition. 

But in today’s overstressed media market, scientists must change 
these attitudes if they want to stay in the public eye. They must rec- 
ognize the contributions of bloggers and others, and they should 
encourage any and all experiments that could help science better 
penetrate the news cycle. Even if they are reluctant to talk to the press 
themselves, they should encourage colleagues who do so responsibly. 
Scientists are poised to reach more people than ever, but only if they 
can embrace the very technology that they have developed. 7 
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RESEARCH HIGHLIGHTS 


Fiery fountains 


Geology 37, 219-222 (2009) 

On Earth, volcanic ‘fire fountains’ burst from the ground, 
driven by the fizz of carbonated magma rushing up 
narrow-necked vents. The same spectacular displays 
occurred on the Moon 3 billion-4 billion years ago, despite 
big differences in gravity, magma composition and magma 
viscosity, say Malcolm Rutherford of Brown University in 
Providence, Rhode Island, and Paolo Papale of the National 
Institute of Geophysics and Volcanology in Pisa, Italy. 

By studying data collected from lunar samples, the 
researchers found that gases — primarily carbon 
monoxide rather than the carbon dioxide of terrestrial 
magma — form by graphite oxidation in lunar magma at 
a depth of about 8 kilometres. Modelling the upwelling 
for lunar conditions, they found that the magma escapes 
at the surface at about the same speeds as on Earth, and 


with similar gas volumes. 


EVOLUTION 
Painful pairing 


Curr. Biol. 19, 404-407 (2009) 

Evolution of male sexual traits that harm 
females can be due to some reproductive 
benefit afforded the male by the harm, or to 
a negative side effect of a gene with some 
other positive benefit. 

Cosima Hotzy and Géran Arnqvist of 
Uppsala University in Sweden examined 
the relationship between the vicious 
spines on the gentialia of male seed beetles 
(Callosobruchus maculatus), the damage 
these cause to females during mating, and 
sperm competition success. 

They found a positive correlation between 
spine size and both harm to females and male 
fertilization success. However, statistical 
modelling showed that spine size still 
correlated with sperm success when harm 
was kept constant. This suggests that harm to 
females is simply an unfortunate side effect of 
the spines. 


CHEMISTRY 


Buckyball expansion 


Angew. Chem. Int. Edn doi:10.1002/anie.200805870 
(2009) 

‘Buckyball’ molecular structures aren't 
confined to carbon. Uranium fullerenes that 
contain various numbers of uranium atoms 
have now joined the club, thanks to Peter 
Burns and his colleagues at the University of 
Notre Dame in Indiana. 

Fullerene U60 (pictured right, top) looks 
just like its carbon analogue, whereas U44 
(bottom) is peanut-shaped, comprising 12 
hexagons and 12 pentagons. Fullerenes usually 
have as few adjacent pentagons as possible, 
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but the peanut bucks this trend. This is because 
the peanut shape has higher symmetry, which 
the uranyl clusters prefer over minimizing 
pentagon neighbours, the authors say. 

Uranium fullerenes are stable in solution 
for several months and easy to make by a 
self-assembly process in water — in contrast 
to carbon fullerenes, which have to be zapped 
into being with a huge current. 


CANCER THERAPEUTICS 


Socking it to melanoma 


J. Clin. Invest. doi:10.1172/JC134015 (2009) 
Human melanoma is resistant to treatment 
with retinoic acid, which is effective against 
several other cancers. But by activating the 
transcription factor SOX9, researchers at 
the National Cancer Institute in Bethesda, 
Maryland, may have discovered a way to 
make retinoic acid effective after all. 

In a series of cell culture and tissue 
experiments, Vincent Hearing and his 
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CORBIS 


colleagues identified mechanisms by which 
promoting SOX9 activity slows cell division 
and sensitizes melanoma cells to the drug. In 
mice injected with melanoma cells, a drug 
that activates SOX9 had mild effects on its 
own, but when combined with retinoic acid 
significantly reduced the size of tumours. 


GENOMICS 
Staying in shape 


Science 10.1126/science.1169050 (2009) 
Researchers have wondered why some 
regions of DNA that do not code for 

proteins but nevertheless are functionally 
important have sequences that have not been 
maintained by evolution. It could be because 
evolution is instead working to conserve the 
DNAs structure, say Elliott Margulies at the 
National Human Genome Research Institute 
in Bethesda, Maryland, Thomas Tullius at 
Boston University in Massachusetts and their 
colleagues. 

A DNA helix can have subtle structural 
variations in its grooves and its physical 
arrangement that affect how well proteins 
bind to it. By comparing a portion of DNA 
structure across 36 different species, the team 
estimates that 12% of the human genome 
is conserved — twice the estimate based on 
sequence conservation. The method should 
help to identify new non-coding regions with 
important biological roles. 


SEXUAL SELECTION 


Cheap flights 


Proc. R. Soc. B doi:10.1098/rspb.2009.0090 (2009) 
A flashy tail is often a way for a male bird to 
attract a mate, but the adornment has been 
assumed to make flying more arduous. 


To find out whether this is true, 
Christopher Clark and Robert Dudley at the 
University of California, Berkeley, trained 
Anna’s hummingbirds (Calypte anna) to fly 
in a wind tunnel with and without hugely 
elongated feathers from another species 
tacked onto their tail feathers. Elongated tails 
slowed the birds’ average top speed from 15.1 
to 14.6 metres per second. The increased 
metabolic cost of flight at high speeds was 
about 10%, but negligible at lower speeds. 

The researchers suggest that by hiding tails 
aerodynamically in the wake of the body, 
diverse sexual signals can be kept relatively 
cost-free. 


GENOMICS 


Big is beautiful 


PLoS ONE 4, e4688 (2009) 

Finding gene regulators in the fruitfly 
Drosophila melanogaster has proved difficult: 
looking for conserved non-coding DNA 
sequences, a method that works well in 
vertebrates, has been unsuccessful. Michael 
Eisen at the University of California, 
Berkeley, and his co-workers suggest that this 
is because the fly's genome is compact — not 
because its regulatory architecture differs 
substantially from that of vertebrates. 

Eisen’s group sequenced various loci from 
four species of tephritid — ‘true fruitflies — 
which have large genomes containing islands 
of conserved non-coding DNA sandwiched 
between unconserved stretches. This allowed 
the researchers to pinpoint six conserved 
tephritid non-coding regions that functioned 
as gene enhancers in D. melanogaster 
embryos. Until now, the compact size of the 
D. melanogaster genome has been considered 
aboon to scientists. 


MOLECULAR IMAGING 


Nailing the molecule 


Science 323, 1464-1468 (2009) 

Molecules in gases and liquids are always 
moving thanks to their thermal energy. This 
means that measuring many details of their 
dynamics, which would require accessing 
the interaction of the molecule with nearby 
molecules and light, is impossible. However, 
by using a short laser pulse, a molecule 

can be ‘frozer’ for a few picoseconds (107 
seconds), a time sufficient to examine, using 
ultrafast spectroscopy, the characteristics of 
its molecular dynamics. 

Albert Stolow of the Steacie Institute for 
Molecular Sciences in Ottawa, Canada, and 
his colleagues demonstrate their method with 
the carbon disulphide molecule, observing its 
dynamics in a photochemical reaction. 


EVOLUTION 


Fishy fangs 
Proc. R. Soc. B doi:10.1098/rspb.2009.0141 (2009) 


Carp-like cypriniform fishes lost their teeth 
millions of years ago, but a newly described 
species has come close to redeveloping them. 

Males of the tiny Danionella dracula have 
a series of fang-like pieces of bone projecting 
from their jaws, report Ralf Britz of London's 
Natural History Museum and his colleagues. 
So far, the maximum size reported for the 
fish, from Myanmar, is 16.7 millimetres. Its 
development also appears stunted; it retains 
many larval body features into adulthood and 
lacks 44 bones or parts of bones seen in its 
close relative the zebrafish (Danio rerio). 

Remaining mysteries include the function 
of the toothy projections and whether they 
are formed through the genetic pathway for 
true tooth formation, which is known to be 
conserved in zebrafish. 


MATERIALS 
Five-star ice 


Nature Mater. doi:10.1038/nmat2403 (2009) 

As asnowflake’s sixfold symmetry shows, ice 
crystals contain water molecules locked into 
hexagonal arrays. 

But Angelos Michaelides at the London 
Centre for Nanotechnology and his 
colleagues have identified circumstances in 
which ice breaks this rule. Water molecules 
forming nanometre-wide chains on a 
particular copper surface at 100-140 kelvin 
are arranged in pentagons. The motif 
arises from a balance of hydrogen-bonding 
interactions between water molecules and the 
strain induced by fitting to the lattice of the 
copper surface beneath. 


Correction 

The Research Highlight ‘Pogo-stick pictures’ 
(Nature 458, 11; 2009) described the subject of 
the accompanying image as a mouse cochlear 
hair cell. It is in fact a hippocampal neuron. 
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RESEARCH HIGHLIGHTS 


JOURNAL CLUB 


Ahmad M. Khalil 
Harvard Medical School 


A geneticist views two theories 
of X-chromosome inactivation in 
a broad context. 


Female mammals have two 

X chromosomes, one of which is 
inactivated to ensure that females 
get the same dose of X-linked 
genes as males. Two long, non- 
coding RNA molecules mediate 
this process. One, Xist, initiates 
silencing of the chromosome to be 
inactivated, whereas its antisense 
partner, Tsix, blocks such silencing 
of the remaining X. The exact 
mechanism by which Xist and Tsix 
exert their functions is not known. 

RNA interference (RNAi) 
describes a process by which RNA 
fragments ‘interfere’ with the 
creation of proteins from their RNA 
recipes. The recent discovery of 
RNAi in mammalian cells made it 
tempting to postulate that sense and 
antisense non-coding RNA partners, 
such as Xist and Tsix, are processed 
by the RNAi-associated enzyme 
DICER into small RNAs. Jeannie Lee 
and her team at Harvard Medical 
School have presented several 
lines of evidence in support of this, 
including data that demonstrate 
improper X inactivation in DICER's 
absence (Y. Ogawaet al. Science 
320, 1336-1341; 2008). 

By contrast, a study by David 
Livingston of Harvard's Dana- 
Farber Cancer Institute and his 
colleagues has shown that, in the 
absence of DICER, the inactive 
X chromosome in stem cells 
remains coated with Xist and 
several other repressive markers 
(C. Kanellopoulou et al. Proc. Nat! 
Acad. Sci. USA 106, 1122-1127; 
2009). This group suggests that 
the effects observed by Ogawaet al. 
may have been indirect because 
DICER is also involved in the 
processing of a class of small, non- 
coding RNAs known as microRNAs 
that can alter gene expression by 
fine-tuning protein production. 

Despite their differences, these 
studies should provide an incentive 
to further investigate the potential 
role of the RNAi pathway in the 
nucleus of mammalian cells. This 
will shed light on both X inactivation 
and gene regulation in general. 


Discuss this paper at http://blogs. 
nature.com/nature/journalclub 
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NEWS 


Europe's green billions 


Clean-energy investments target eastern Europe, but it remains to be 
seen whether they will create as many new jobs as hoped. 


Three months after European Union (EU) 
leaders committed themselves to cutting 
greenhouse gases by at least 20% from 1990 
levels by 2020, the European Commission has 
announced that it will subsidize ‘green’ pro- 
grammes and infrastructures with €105 billion 
(US$136 billion). 

The windfall comes from the union’s 
€347 billion in ‘structural’ funds. It is the 
second-largest budget in the commission's 
portfolio, behind agriculture, 
and is aimed at creating equal 
standards of living across 
the EU. 

Although no fresh money is 
involved — the size of the struc- 
tural funds was agreed in 2007 — the amount 
allocated to green projects is nearly triple that 
of the previous budget period, from 2000 to 
2006. Regions within the EU are free to decide 
how to spend the money, as long as they meet 
the general criteria and priorities set by the 
commission. 

The money is necessary if Europe is to deliver 
on its high hopes for wind and other low- 
carbon energy sources, says Greg Arrowsmith, 
a policy officer with the European Renew- 
able Energy Centers Agency in Brussels. Just 
€2.3 billion of the EU’s €51-billion Seventh 
Framework Programme — Europe's main 


“The massive public 
stimulus will give a 
lot of extra impetus to 
green investment.” 


source of research funding — is earmarked 
for non-nuclear energy technologies. 

The new investment will not cover car- 
bon capture and storage projects, but it does 
address many others related to energy and cli- 
mate. Around €9 billion is targeted for renew- 
able energy and energy-efficiency projects, 
and an additional €29 billion will be invested 
in modernizing railways and in urban trans- 
port projects aimed at reducing emissions 
from cars. Significant fund- 
ing will also be available for 
water and waste-management 
projects, biodiversity and 
nature conservation, and for 
encouraging the development 
of eco-friendly goods and services by small 
and medium-sized enterprises. 

The commission hopes that the investment 
will create new jobs and help to revive regions 
that are suffering in the current economic 
downturn. “Support for the green economy 
and the environment goes hand-in-hand 
with the cohesion policy’s objective to deliver 
sustainable growth, jobs and competitive- 
ness,’ regional policy commissioner Danuta 
Hiibner said when announcing the figures on 
9 March. 

The bulk of the money (€60 billion) will 
go to countries in eastern and southeastern 


Europe that lag behind the rest of the union, 
economically and environmentally. The larg- 
est beneficiaries will be Poland (€18 billion), 
Spain (€12 billion), the Czech Republic (€11 
billion) and Hungary (€9 billion). Money from 
the regional funds could, for example, be used 
to convert old ports on Poland’s Baltic coast 
into transit points for testing and installing off- 
shore wind turbines. It is, however, up to local 
politicians to decide exactly how to apportion 
the funds. 

The plan exceeds the expectations of many 
economists and energy experts, who had 
hoped that the economic crisis would create an 
impetus for ‘greening’ the economy, as has hap- 
pened with massive investments in countries 


B. STRONG/REUTERS 


Society sues journal over right to reply 


The Max Planck Society (MPS) 

in Germany has begun legal 
proceedings against publishers 
Wiley International in a dispute over 
an editorial in the February issue of 
Human Brain Mapping. 

The society alleges that the 
editorial grievously misrepresents 
it and harms the reputation of one 
of its scientists. It wants the journal 
to publish a letter from the society 
addressing these concerns without 
delay. 

Peter Fox, an editor-in-chief at 
Human Brain Mapping, says that the 
MPS letter went through normal 
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refereeing processes “in a timely 
manner", but says he does not know 
when it will be published. MPS 
vice-president Herbert Jackle, a 
developmental biologist who was 
deputized to speak for the society, 
claims that the journal has unfairly 
delayed the society's right to reply. 
The dispute has been raging for 
nearly a year. Fox, a neurologist 
at the University of Texas Health 
Science Center in San Antonio, 
accepted the letter on 11 March, two 
days after the deadline the society 
set before taking legal action. Fox 
says that he will publish the letter 


together with a reply that “rebuts Dr 
Jackle's various accusations”. 

“| don't have a problem with the 
journal printing a reply," says Jackle. 
“But | fear that without expedited 
publication, the letter would appear 
on the web only after months of 
delay and in print only ina year." 

The row originally centred on 
a dispute over who owned data 
gathered in the laboratory of Nikos 
Logothetis at the Max Planck 
Institute for Biological Cybernetics in 
Tubingen. Last spring, Amir Shmuel 
and David Leopold used the data in 
a paper for a special issue of Human 
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Herbert Jackle wants the Max 


Planck Society's letter published. 


Brain Mapping on spontaneous brain 
activity — activity that occurs when 
there is no visual input. Logothetis 


MAX PLANCK SOCIETY 


Wind farms could benefit from greater 
investment by the European Commission. 


such as the United States, Japan, South Korea 
and Germany. 

“This is the way to go,’ says Claudia Kemfert, 
an expert on climate, energy and transport at 
the German Institute for Economic Research 
in Berlin. “This kind of economic stimulus 
makes a lot more sense than bonuses to people 
who give up old cars to buy new vehicles.” A 
€2,500 ‘car-scrapping’ incentive, part of Ger- 
many’s own €50-billion stimulus package, is 
boosting car sales, but has been criticized on 
economic and environmental grounds. 

Oliver Schafer, policy director at the 
Brussels-based European Renewable Energy 
Council, says that the EU’s renewable-energy 
industry — involving some 350,000 jobs, 
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and generating €40 billion a year in business 
volume — will benefit substantially from 
the boost in public funding. Experts have 
estimated that it will take around €600 bil- 
lion to help the EU increase the amount of 
energy it obtains from renewable sources 
from the current 8.5% to its goal of 20% in 
2020. Subsidies through the structural funds 
programme will provide start-up financing 
for new solar, hydro- and wind-power facili- 
ties, for wires needed to transport ‘gree’ 
electricity to consumers, and for decentral- 
ized energy supplies such as through rooftop 
solar-energy collectors. 

But promises of funding injections have hit 
stumbling blocks in the past. The commis- 
sion’s economic recovery plan, proposed last 
November, foresees €3.5 billion in unspent 
money from the past budget period being 
reallocated to investment in energy technolo- 
gies, including €500 million for offshore wind 
projects and €1.5 million for carbon capture 
and sequestration. But several groups in the 
European Parliament, which will vote on the 
issue later this month, are seeking to stop the 
plan. Critics say that few of the envisaged 
projects will materialize soon enough to help 
economic recovery. 

Regardless, the €105 billion from the 
structural funds will still flow. “Sustain- 
able funds have had renewable energies 
and water projects on their radar for quite 
some time,’ says Rolf Hafler, an analyst with 
the rating agency Oekom Research in 
Munich, Germany. “The massive public 
stimulus will give a lot of extra impetus to 
green investment.” a 
Quirin Schiermeier 


editorial on the website of 


says that the data were used 
inappropriately because 

they had been gathered fora 
different purpose, when the 
research monkeys were staring 
at a flickering screen. He wrote 
to the journal's editor, among 
other actions, asking for the 
paper not to be published’. 

It was, however, published 
online? on 8 May 2008. 
Logothetis published 
ascientific rebuttal in 
Neurolmage?, which appeared 
online in January 2009. 

Fox, together with the 
three other editors of the 
special issue, published an 
extended editorial analysing 
the dispute in the framework 


of ethics. This appeared online 
on 9 December 2008 and in 
print in the February issue 
of Human Brain Mapping’. 
The MPS says that there are 
numerous factual errors in the 
editorial's account of events. 
Jackle is also incensed by the 
article's claims that he had 
given permission to publish 
the disputed paper. “I was only 
a mediator ina dispute, not 
an adjudicator as the editorial 
claims," he says. “It is not up 
to the Max Planck Society to 
permit publication of anything 
— that responsibility lies 
solely with the editors.” 

The MPS has posted a 
response to the February 
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Logothetis's laboratory (see 
http://tinyurl.com/c4yq9m). 
“The goal of our editorial 

was to use this conflict to 
discuss the ethical principles 
that govern responsible 
conduct of research and 

peer review,” says Fox, “and 
thereby to develop guidelines 
that would prevent future 
occurrences of this nature.” & 
Alison Abbott 


1. Abbott, A. Nature 454, 6-7 (2008). 

2. Shmuel, A. & Leopold, D. A. Hum. 
Brain. Mapp. 29, 751-761 (2008). 

3. Logothetis, N. K. et al. Neurolmage 
doi:10.1016/j.neuroimage. 
2009.01.010 (2009). 

4. Fox, P. T. et al. Hum. Brain Mapp. 30, 
347-354 (2009). 


Scientists in 
bone battle 


Officials at the University of California are 
moving to give two of the oldest-known 
skeletons in North America to a local Native 
American tribe, against the recommendation 
of university scientists who say the bones 
should be retained for study. 

Under federal law, bones are returned to 
a tribe that can prove ‘cultural affiliation’ 
through artefacts or other analyses. At nearly 
10,000 years old, the skeletons in question 
— unearthed in 1976 at the University of 
California, San Diego (UCSD) — are so 
ancient that they are not culturally linked to 
any tribe (see Nature 455, 1156-1157; 2008). 

But last month, University of California 
president Mark Yudof and UCSD chancellor 
Marye Anne Fox began seeking a rare federal 
approval to give the skeletons to the local 
Kumeyaay tribe, which has asked for them. 
And some anthropologists say the decision is 
based on politics, not science. 

“This is scandalous,’ says Robert Bettinger, 
an anthropologist at the University of 
California, Davis, who is on the panel that 
oversees how archaeological remains are 
handled at all ten University of California 
campuses. The panel was not consulted on 
this transfer proposal. 

“This goes against the policies of President 
Barack Obama for science-based decisions, 
not belief-based ones,” says anthropologist 
Margaret Schoeninger, who chairs the UCSD 
committee that reviews such specimens. The 
final decision is likely to be made by Obama 
appointee Ken Salazar, secretary of the 
Department of the Interior. 

Officials at the Department of the Interior 
say the request will be considered by the 
committee that reviews the return of remains 
and artefacts when it meets in Seattle, 
Washington, on 23-24 May. 

Fox declined an interview, but said in 
a statement that the transfer “seems an 
appropriate balance between the interests of 
science and [those] of the Native American 
community”. 

This week, Schoeninger is petitioning the 
research committee of the UCSD faculty 
senate to try to prevent the transfer. But 
Kumeyaay spokesman Steve Banegas says the 
tribe impatiently awaits the chance to bury 
the bones. “This is long overdue,” he says. ™ 
Rex Dalton 
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America's long hard road 
to climate-change law 


What challenges lie ahead as the United States tries to construct a 
working system for greenhouse-gas regulation? Jeff Tollefson reports. 


“We're back.” That was the message from Todd 
Stern, the United States’ new international 
envoy on climate, to congressional staff, 
scientists and representatives of several coun- 
tries on 3 March in Washington DC. 

In a room overlooking the Capitol dome, 
Stern noted the United States’ responsibility as 
the historical emitter of most greenhouse gases 
globally. He explicitly recognized the right of 
developing nations to lift their people out of 
poverty, and pledged resources to help them 
make the transition to clean development. 

But Stern — the person charged with leading 
US negotiations through the United Nations 
climate-change talks — went on to challenge 
Europe on multiple fronts. Most notably, he sug- 
gested that Europe’ proposal for industrialized 
nations to reduce their emissions by 25-40% 
below 1990 levels by 2020 is unfair, unnecessary 
and “a prescription for stalemate”. 

President Barack Obama has pushed for a 
cap-and-trade system to control greenhouse 
gases in which industries can buy and sell 
permits, auctioned off from the start, in order 
to stay within an overall cap on pollution. It 
is not yet clear whether the United States will 
embrace such a strict regime. But for Obama, 
the place to start is working with Congress. 

Congressional leaders have talked before 
about legislation to create a US cap-and-trade 
system, but the bills usually 
never make it out of com- 


lawmakers. This year, how- 
ever, may be different. With 
the president on board and strong Democratic 
majorities in both congressional houses, 2009 
may be the year when cap-and-trade legisla- 
tion gets done. The question on Capitol Hill is 
not so much if but when Obama will sign such 
a bill into law — and how that law will mesh 
with US negotiations for the next international 
climate agreement. In December, international 
envoys will meet in Copenhagen to hammer 
out a successor to the Kyoto Protocol on Cli- 
mate Change. Stern will represent Obama 
there, and he knows the necessity of working 
with Congress. In 1997, even as he was helping 
negotiate the final details of the Kyoto Proto- 
col, the Senate signalled its discontent with a 
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"This is too important to 
mittee to be voted on by all be left to the president.” 
— Ed Miliband 


symbolic 95-0 vote against the treaty. 

Europe has recently been streaming 
emissaries through Washington DC in an effort 
to build alliances and provide perspectives on 
how to set up a cap-and-trade system. And they 
are actively engaging members of Congress 
and policy-makers instead of just focusing on 
Obama. “This is too important to be left to the 
president,’ says Ed Miliband, the UK Secretary 
of State for Energy and Climate Change. “The 
Congress needs to be an equal partner.” 

Obama has appointed an all-star cast to 
develop a federal strategy on climate and to 
work closely with Congress on the issue (see 
‘The key players’). No one is yet sure how well 
this assembled cast will work to advance cli- 
mate policy. “But there are some very smart 
and very capable people who have gone into 
the administration, who have a deep knowl- 
edge of climate policy and who have all of the 
talent and capability to get this done,’ says Tim 
Profeta, who heads Duke University’s Nicholas 
Institute for Environmental Policy Solutions in 
Durham, North Carolina. 

The first moves from the administration 
might come as early as this month, when 
the Environmental Protection Agency could 
announce its long-delayed response to a 2007 
Supreme Court decision establishing the 
agency’s authority to regulate carbon dioxide 
as a pollutant. Lisa Jackson, 
the agency’s administra- 
tor, is almost certain to issue 
an ‘endangerment’ finding 
declaring CO, a pollutant that 
should be regulated under the 1970 Clean Air 
Act. Technically, such a move would allow the 
Obama administration to move forward with 
regulating carbon on its own. 

But Obama's main goal is to push a bill 
through Congress. The first actions from 
lawmakers could come from the House of 
Representatives. Henry Waxman (Democrat, 
California), chair of the House energy and com- 
merce committee, has said his committee will 
produce a comprehensive energy and climate 
bill by the end of May. House speaker Nancy 
Pelosi (Democrat, California) has promised to 
bring such a bill out of committee and onto the 
full House floor for a vote this year; it would be 
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the first-ever climate vote in the House. 

Alden Meyer, who handles climate policy for 
the Union of Concerned Scientists in Washing- 
ton DC, says that Waxmans approach of putting 
energy and climate into one bill makes sense. 
Like many, Meyer argues that issues such as 
energy efficiency, renewable energy and trans- 
portation are more easily addressed through 
separate regulations than through the cap-and- 
trade programme itself. “That's one of the things 
we've learned,’ says Meyer. “You need other pol- 
icy mechanisms in addition to cap-and-trade” 

It’s not yet clear whether the Senate will take 
that same comprehensive approach. Last June, 
the Senate shot down a bill that would have 
reduced emissions to 63% below 2005 levels by 
2050. Senate majority leader Harry Reid (Demo- 
crat, Nevada) has said he will bring a bill to the 
floor again this year, but there is still debate as to 
how comprehensive it might be — or not. Bar- 
bara Boxer (Democrat, California), who chairs 
the Senate’s environment and public works 
committee, has said she wants her committee 
to produce “simplified legislation”. That could 
well mean a cap-and-trade outline only, leaving 
other committees to handle provisions related to 
things such as energy, taxes and agriculture. 


Balancing act 

Any energy issues would be headed by Jeff Bin- 
gaman (Democrat, New Mexico), who chairs 
the energy and natural resources committee 
in the Senate and who fears that progress on 
renewables might be held hostage by ham- 
mering out the more complicated details of 
cap-and-trade legislation. To overcome such 
blocks, Bingaman is working with a loose 
group of centrists from both parties, often 
dubbed the ‘Gang of 16, members of which 
are searching for ways to make greenhouse- 
gas caps more politically palatable. Their staff 
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Congress is under increasing pressure to pass legislation regulating carbon dioxide emissions. 


are looking at technical solutions that tackle the 
thorniest problems, including how to control 
costs, provide environmental integrity and 
ensure that domestic industries are not put 
at a disadvantage if, say, China fails to act on 
controlling its own emissions. “This whole 
debate often focuses on the 
differences between those who 
advocate clear environmental 
certainty and those who want 
clear economic certainty,’ says 
Jonathan Black, who handles climate for Binga- 
man on the energy committee. 

The same debates have played out interna- 
tionally, with some countries suddenly backped- 
alling on commitments even as the United States 
steps forward. Canadian voters rejected a strong 
environmental platform proposed by the Liberal 
party last autumn, and cap-and-trade legislation 
is floundering in Australia despite support from 
Prime Minister Kevin Rudd. European leaders 
had to make concessions to various industries 
in December 2008 to hold together their coali- 
tion on the next phase of their carbon trading 
scheme, which begins in 2013. 

If it had been successful, last June’s climate 
legislation in the United States would have 
controlled costs by creating an independent 
federal board with the authority to intervene 
if the market gets out of control. Companies 
would also have been able to borrow some 
credits from the future, meaning they could 
emit more now, as long as they emit even less 
in subsequent years. Bingaman favours the 
use of set minimum and maximum prices for 
credits to ensure economic predictability, but 
sucha ‘price collar’ would not guarantee emis- 
sions reductions because more credits could be 
issued if prices were to reach a set level. 


Many lawmakers are also looking at border- 
tax adjustments — essentially tariffs — on 
imports if certain industries find themselves at 
an economic disadvantage as a result of carbon 
regulations. Another method for protecting 
specific industries is to allocate some emis- 

sions permits to them for free, 
which is how European regu- 
lators will handle the issue in 
the next phase of their emis- 
sions trading scheme. 

The challenge of getting Congress to act 
becomes even more difficult if Stern actu- 
ally brings home an international treaty from 
Copenhagen, which requires a two-thirds 
majority (67 votes) to pass in the Senate. “There 
are far less controversial agreements that have 
not managed to attract that level of superma- 
jority here in the Senate,” says Nigel Purvis, a 
former US climate negotiator who works on 
climate issues at the Brookings Institution in 
Washington DC. 

That has Purvis, as well as some in Congress 
and the administration, thinking about ‘con- 
gressional executive agreements. Used for most 
international trade agreements, these have the 
same legal effect as a treaty, but require only 60 
votes to pass in the Senate. In these, Congress 
would authorize the president to negotiate an 
agreement, which is then voted yes or no by 
Congress on its return. 

Kathleen Frangione, chief climate aide to 
Senator John Kerry (Democrat, Massachusetts) 
on the Senate foreign relations committee, says 
that no matter what, the White House will have 
to be fully engaged iflegislation is going to move 
this year. “The question is how much they are 
really willing to fight for this this year,’ she says. 
“And we don't really know yet.” 
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THE KEY PLAYERS 


Leaders in both houses of US 
Congress say they will bring 
climate legislation to a vote this 
year, So congressional staff are 
busy hammering out the details 
of such a bill — particularly a 
cap-and-trade solution to limit 
greenhouse-gas emissions. The 
Obama administration is pushing 
to get such legislation passed 

by lawmakers. The debate’s key 
players are almost all Democrats, 
who expanded their majorities in 
Congress in the 2008 election. 


THE OBAMA 
ADMINISTRATION 
Carol Browner 

Assistant to the president 
on energy and climate 
change 


Lisa Jackson 
Administrator, 
Environmental Protection 
Agency 


HOUSE OF 
REPRESENTATIVES 
Nancy Pelosi 
Democrat, California 
Speaker of the House 


Henry Waxman 
Democrat, California 
Chairman of the House 
committee on energy 
and commerce 


SENATE 

Harry Reid 
Democrat, Nevada 
Majority leader 


Barbara Boxer 
Democrat, California 
Chairwoman of the 
Senate committee on 
environment and public 
works 


Jeff Bingaman 
Democrat, New Mexico 
Chairman of the Senate 
committee on energy 
and natural resources 
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Big interestin heavy drugs 


The drug industry is seeking profits by modifying hydrogen in existing medications. 


Pharmaceutical companies are beginning 
to bet on the idea that simply switching a 
hydrogen atom with a heavier isotope in a cur- 
rently approved drug could create a better drug. 
Encouraged by results from clinical trials, com- 
panies are snapping up intellectual-property 
rights on many of the modified drugs. 

On 16 March, for example, Concert 
Pharmaceuticals of Lexington, Massachu- 
setts, reported results of a phase I clinical 
trial for a version of the antidepressant par- 
oxetine, sold as Seroxat by GlaxoSmithKline 
and first marketed under that trade name 
in 1992. Concert’s version swaps out one or 
more of the hydrogen atoms in the paroxet- 
ine structure (see ‘Drug modifications’) for 
deuterium, a heavier isotope of hydrogen that 
contains a proton and a neutron, rather than 
just a proton. The company was testing this 
deuterated version for treating hot flushes 
without the side effect of standard paroxetine 
in which the liver enzyme CYP2D6 is inacti- 
vated. Because this enzyme metabolizes many 
other drugs, inactivating it means it is hard to 
take other drugs along with paroxetine. 

In Concert’s trial, 94 women tested paroxet- 
ine — both standard and deuterated versions 
— along with the cough medicine dextrometh- 
orphan, which is metabolized by CYP2D6. 
Taken with standard paroxetine, metabolic 
uptake of the cough medicine is inhibited, 
says Roger Tung, chief executive at Concert. 
However, the deuterated version showed less 
metabolic inhibition — suggesting that the 
drug might be better to combine with others. 

In theory, deuterated drugs can work differ- 
ently in the body because deuterium can make 
stronger chemical bonds than hydrogen. This 
can affect how quickly a drug is broken down. 

Another company working with deuteration 


DRUG MODIFICATIONS 


Pharmaceutical companies have replaced 
hydrogen atoms with deuterium to create 
versions of paroxetine and venlafaxine to test 
in clinical trials. 


H3CO 


Paroxetine Venlafaxine 


Substituting atoms in drugs could allow patenting opportunities and create better pharmaceuticals. 


is Auspex Pharmaceuticals, based in Vista, 
California. Last October, Auspex announced 
phase I clinical trial results with a deuterated 
version of venlafaxine. Venlafaxine was first pro- 
duced by Wyeth, in 1993, as the antidepressant 
Effexor. The trial suggested that the deuterated 
version of the drug stayed in the bloodstream 
longer than the non-deuterated version and may 
even cause fewer side effects, says Mike Grey, 
chief executive of Auspex. 

The company already has a patent on its 
deuterated version of venlafaxine, and has filed 
more than 150 other US patents on families of 
deuterated drugs. Concert has filed more than 
100 US patent applications and has received 
two notices of allowance already, and is expect- 
ing to get its first patents this year. 


Beyond the obvious 

Kevin Mooney, a patent lawyer with Simmons 
& Simmons in London, says the strategy of pil- 
ing up patent applications on deuterated ver- 
sions of existing drugs is legitimate. “Everyone 
is entitled to research on other compounds,” 
he says. Still, the onus will be on companies to 
demonstrate the usual criteria for patentabil- 
ity. “They would have to show that this deu- 
terated form of paroxetine was new,’ he says, 
“then they would have to show that it wasn’t 
an obvious thing to do?’ Patent applications — 
for example, those involving different salts of 
known compounds — often fail on this second 
requirement of non-obviousness. 

Tung is confident that his company’s tactics 
will work. “We treat this as an entirely new 
chemical entity,’ he says. 

But in future, getting such patents will be 
harder, says Kirk Gallagher, a pharmaceutical 
patent lawyer at D Young and Co in London. 
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“The obviousness bar will be raised as time 
goes by,’ he says, as the idea of deuterating a 
drug to get different pharmacokinetic proper- 
ties becomes commonplace. Rather than the 
broad patents covering families of drugs being 
applied for now, companies will have to show 
improved action of specific molecules to con- 
vince a patent-giver that they have something 
new, Gallagher says. “They'll have to do more 
science before they file their patents.” 

Derek Lowe, a drug-discovery scientist and 
author of the In the Pipeline blog, says larger 
drug companies may take the same approach. 
“T think that every big pharma company is 
keeping this in mind,’ he says. “That’s where 
the start-ups are going to have a problem” 

The approach with deuterated drugs is 
similar to that used by Sepracor, based in 
Marlborough, Massachusetts, in the 1980s. 
Until that point, patents didn’t include 
specific information on isomeric forms of 
drugs. Sepracor made its business by filing 
patents on active isomers of known drugs. In 
response, pharmaceutical companies began 
to routinely specify chiral isomers on patents. 
“Pharma companies will now probably add 
deuterium analogues,” says Gallagher. 

Grey isn't worried. “There is still a vast 
landscape for us to explore,” he says. Auspex 
hopes to partner with larger drug companies 
to continue developing its products. 

Concert is similarly looking for ways for- 
ward. The company has raised US$96 million 
in start-up money so far, but has decided not 
to develop its deuterated paroxetine further for 
now — focusing instead on another deuterated 
drug, an HIV protease inhibitor, which it thinks 
will be a bigger seller. a 
Katharine Sanderson 
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Roche vows to keep Genentech culture 


Anera has ended for the biotechnology industry 
with the full merger of biotech pioneer 
Genentech, of South San Francisco, and Swiss 
pharmaceutical giant Roche, based in Basel. 

The deal, announced on 12 March, ends 
Roche’s eight-month pursuit of Genentech. 
Roche’ final offer of about $46.8 billion — $95 
a share — will buy it the 44% or so of Genentech 
that it doesn’t already own. The merger is the 
latest of several mega-mergers rippling through 
the drug industry (see ‘Getting it together’). 

But the Roche-Genentech deal has a unique 
resonance in the science and biotech commu- 
nities, because Genentech’s creative culture 
and rigorous science have been seen as essen- 
tial to its ongoing success in producing novel 
medicines. And although Roche has vowed to 
preserve Genentech’ entrepreneurial scientific 
culture, observers say that this may not be pos- 
sible under Swiss leadership. 

“Tt just seems unlikely that Genentech can 
maintain its creativity, because the difference 
between the two corporate cultures is not a 
superficial one,” says Laurence Lasky, who 
worked at Genentech from 1982 to 2002 and 
was a Genentech fellow — the company’s top 
science rank — when he left. “It’s sad, because 
Genentech really is one of the greatest home- 
grown things that we have created in this coun- 
try,’ says Lasky, who is now a partner at US 
Venture Partners of Menlo Park, California. 

Ina statement on the deal, the two companies 
said that Genentech’s “unique culture [will 
be] maintained”. Company executives are not 
speaking to the press about how exactly this 
will be accomplished. But former Genentech 
employees say that similar fears surfaced in 
1990, when Roche bought a majority stake in 
Genentech, and proved to be unfounded. “That 
partnership didn't hurt our creativity; Roche 
stayed at a distance and Genentech continued 
to do great work,’ says David Goeddel, who 
was the first scientist hired at Genentech and is 
now a managing partner at the venture-capital 
firm the Column Group in San Francisco. 

But, Goeddel adds, “I think the complete loss 
of any independence for Genentech is kind of 
sad after so many years”. 

Founded in 1976, Genentech grew from a 
tiny start-up to an almost $100-billion com- 
pany by capitalizing on pioneering science. For 
instance, Genentech scientists were the first to 
engineer bacteria to produce human proteins, 
such as insulin, which became the company’s 
first drug — and the world’s first genetically 
engineered drug — in 1982. 

Since then, Genentech has developed a 
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Genentech's reputation for scientific creativity is often credited with its success in drug development. 


pipeline of biological drugs worth $9.5 billion 
in annual sales in the United States alone. Three 
of these are now Roche's best sellers. One, the 
cancer medicine bevacizumab, sold as Avastin, 
is already worth $4.8 billion a year and is cur- 
rently in clinical tests, due to report results in 
about a month, that may expand its uses. 
Genentech’s story has served as a model for 
the rest of the biotech industry, and young com- 
panies often speak of aspiring to become the 
next Genentech. But, Goeddel says, “that’s pretty 
impossible to do now” Panicked by their own 
lack of promising drug candidates, big pharma- 
ceutical companies are picking off small bio- 
techs at a much earlier stage than they did in the 


Getting it together 


2009 is shaping up to be the year of the 
pharmaceutical mega-merger. Analysts 

say that such mergers are likely to be only a 
short-term fix for companies’ dwindling drug 
pipelines. Among the recent highlights: 
January 2009: Pfizer acquires Wyeth for 
$68 billion, in part to make up for the fact 
that its blockbuster cholesterol drug Lipitor 
comes off patent in 2011. 

March 2009: Merck buys Schering- 

Plough for $41 billion, in part to acquire the 
company's pipeline of drugs in the late stages 
of development. 

March 2009: Roche, which already owned 

a majority stake in Genentech, buys out the 
rest of the company for $47 billion. 
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past, preventing the companies from evolving 
independently. Some of the other early biotech 
companies have survived as a result of mergers 
and acquisitions: Biogen, based in Cambridge, 
Massachusetts, merged with Idec of San Diego, 
California, in 2003, and Amgen, headquartered 
in Thousand Oaks, California, has acquired sev- 
eral smaller biotechs over the years. 

Roche is thought to have been spurred to 
solidify its hold over Genentech because its 
exclusive rights to sell Genentech products 
outside the United States were set to expire in 
2015. Roche first bid for Genentech last July, 
offering $44 billion, or $89 per share, for the 
rest of the company. 

Now that the two firms have agreed ona final 
number, the merged company is set to become 
the seventh largest pharmaceutical firm in the 
United States, with an annual revenue of about 
$17 billion. It will retain the Genentech brand 
in the United States and will be headquartered 
at Genentech’s South San Francisco campus. 

But the campus may see an exodus of Genen- 
tech employees, possibly including chairman 
and chief executive Arthur Levinson, who joined 
the company in 1980. “My guess is it would be 
very hard for [Levinson] to stay, because he’s 
used to running his own show,’ Lasky says. 
Levinson is not speaking to reporters, but said 
in a statement: “We look forward to working 
with our partners at Roche to ensure a smooth 
transition once the transaction is complete and 
to continue our mission of serving patients.” ™ 
Erika Check Hayden 
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Incoming chief to tackle woes 
of US food and drug agency 


Physician Margaret Hamburg, once the 
youngest-ever health commissioner for New 
York City, was nominated on 14 March by 
President Barack Obama to head the beleagu- 
ered US Food and Drug Administration (FDA). 
If confirmed by the Senate, she would bring to 
the commissioner's job a background of public- 
health experience that runs from working at 
the National Institute of Allergy and Infectious 
Diseases to advising the Centers for Disease 
Control and Prevention, and from bioterror- 
ism planning to pandemic-flu preparation and 
tuberculosis control. 

The president also named Joshua Sharfstein, 
a paediatrician who is the health commissioner 
in Baltimore, Maryland, to be Hamburg’s prin- 
cipal deputy commissioner. His appointment 
does not require Senate confirmation. 

The $2.6-billion agency, still regarded as a 
global gold standard for drug regulation, has 
seen its image tarnished and its morale sapped 
in the past decade by a slew of regulatory failures 
related to both food and drugs (see Nature 457, 
770-771; 2009). Numerous reports by govern- 
ment and scientific bodies have documented 
the erosion of the FDAs scientific and oversight 
capabilities, even as it faces burgeoning respon- 
sibilities. And episodes such as its delay in mak- 
ing the emergency contraceptive Plan B available 
over the counter led to charges of undue political 
influence on its decision-making under former 
commissioner Andrew von Eschenbach. 

Obama, speaking in his weekly video 
address, said the FDA has been “underfunded 
and understaffed”, and announced the forma- 
tion of a Food Safety Working Group of top 
government officials. 

People familiar with Hamburg and Sharf- 
stein said last week that although the two 
are not bench scientists, they would bring a 
science-based approach to the FDA. 

Hamburg “has a great appreciation that public 
policy must be driven by the very best science’, 
says Gail Cassell, a microbiologist and vice- 
president of scientific affairs at drug company 
Eli Lilly, based in Indianapolis, Indiana. For 
instance, as health commissioner in New York 
in 1991, Hamburg campaigned vigorously for 
— and achieved — the establishment of a then- 
controversial needle-exchange programme to 
slow the transmission of HIV/AIDS. She also 
vocally opposed a requirement that AIDS edu- 
cators stress sexual abstinence to city students. 

Ina rare move, Republican mayor Rudolph 
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Margaret ‘Peggy’ Hamburg will lead the FDA. 


Giuliani asked Hamburg to stay on after he took 
over from Democrat David Dinkins, who had 
hired her. During her tenure, she boosted child- 
hood immunization rates to record levels while 
dramatically lowering tuberculosis incidence 
by implementing a novel programme later cited 
as a model by the World Health Organization. 
She also created the country’s first modern 
bioterrorism preparedness programme. 

Hamburg left New York to become a lead 
strategist for the Department of Health and 
Human Services under then-president Bill 
Clinton, where she was in charge of planning 
for bioterrorism, among other things. “She 
was definitely ahead of the curve” in seeing 
the need for bioterrorism preparedness, says 
Thomas Inglesby, deputy director of the Center 
for Biosecurity of the University of Pittsburgh 
Medical Center in Pennsylvania. Most recently, 
she has served as a senior scientist at the Nuclear 
Threat Initiative in Washington DC, where she 
developed a strategic plan and grant-making 
portfolio addressing biological threats. 

Sharfstein, a former staffer for liberal Con- 
gressman Henry Waxman (Democrat, Cali- 
fornia), has a history of confronting the drug 
industry. In 1997, as a medical resident, he 
wrote to The New England Journal of Medicine 
complaining that a Pfizer-sponsored event 
promoting its products to young doctors in 
Boston, Massachusetts, offered free pool and 
alcohol, urging them to “rack em up & toss em 
down”. (Ina response, the company admitted 
that its slogan “created a poor impression”) 

In 2007, as health commissioner in Balti- 
more, Sharfstein successfully petitioned the 
FDA to oppose the use of over-the-counter 
cough and cold medicines in children under 
two years old. a 
Meredith Wadman 
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Yale climate-change 
institute launched 


Rajendra Pachauri, chairman of the 
Intergovernmental Panel on Climate 
Change (IPCC), will head a new climate and 
energy research institute 
at Yale University. 

The institute and its 
head were announced 
on 10 March at an 
international climate- 
change conference 

in Copenhagen (see 
‘Soundbites from 
Copenhager). 

Pachauri, Nature’s 
Newsmaker of the Year 
in 2007, will work part- 
time at the Yale institute, 
retaining his current positions with the IPCC 
and as director-general of India’s Energy and 
Resources Institute in New Delhi. 


Rajendra Pachauri. 


NIH research to become 
permanently open access 


The annually renewable public-access 
policy first implemented by the US National 
Institutes of Health (NIH) a year ago was 


made permanent in a spending bill signed 
into law on 11 March. The provision 
specifies that all NIH-funded investigators 
must submit an electronic version of their 
peer-reviewed manuscripts to the National 
Library of Medicine’s PubMed Central. The 
manuscripts must be made publicly available 
no later than 12 months after publication. 
The new law specifies that it must be 
implemented in a manner consistent with 
copyright law. Congressman John Conyers 
(Democrat, Michigan) is hoping to use that 
clause to upend the policy. His Fair Copyright 


in Research Works Act would amend US 
copyright law to forbid federal agencies from 
making funding agreements, such as NIH 
grants, conditional on authors making copies 


of their peer-reviewed articles freely available. 


Maths mistake blamed for 
killing off ‘clean coal’ plant 


The huge cost overruns cited as a reason 
for cancelling the US government's flagship 
carbon-sequestration plant, FutureGen, 


SOUNDBITES FROM COPENHAGEN 


Views from the International Scientific Congress on Climate Change, 10-12 March. 


“On any kind of pragmatic timescale, | 
think we should see loss of the Amazon 
forest as irreversible.” 

Chris Jones Met Office Hadley Centre, Exeter, UK 


“I'm frustrated, as are many of my 
colleagues, that 30 years after the US 
National Academy of Sciences issued 
a strong warning on CO, warming, the 
full urgency of this problem hasn't 
dawned on politicians and the general 
public.” 


Stefan Rahmstorf Potsdam Institute for Climate 
Impact Research, Germany 


“Ice loss in Greenland has accelerated 
over the last decade. The upper range of 
sea level rise by 2100 might be 1 metre 
or more.” 

Konrad Steffen University of Colorado, Boulder 


“Do the politicians understand just how 
difficult it could be, just how devastating 
4,5, 6 degrees centigrade would be? | 
think, not yet.” 

Nicholas Stern former UK government economics 
adviser 


For coverage of the conference, see http://tinyurl. 
com/dm9plg and http://tinyurl.com/daroma 
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were due in part to a mathematical error, a 
congressional report has found. 

Announced by President George W. Bush 
in 2003, the plant was to have captured and 
stored 90% of its carbon dioxide emissions. 
It was originally slated to cost $950 million, 
but was cancelled in early 2008 when new 
estimates pegged its cost at $1.8 billion (see 
Nature 451, 612-613; 2008). 

But those estimates were not comparable, 
the Government Accountability Office 
report says. The $950-million estimate was 
done in constant dollars; the $1.8-billion 
figure included expected increases for 
inflation. The 2008 figure would have been 
$1.3 billion if adjusted for constant dollars. 

Steven Chu, the new energy secretary, said 
last week that he plans to meet soon with 
private partners involved in the project to 
discuss whether and how to take it forward. 


Lighter Higgs boson 
harder to find 


Physicists have placed fresh limits on the 
mass of the Higgs boson — the particle 
thought to confer mass on other matter. 
The particle was assumed to have an 
energy (or equivalent mass) of between 
114 and 185 gigaelectronvolts (GeV). But, 
after analysing pooled data, researchers 


Fermilab's finding may cause problems for CERN. 


at the Tevatron particle accelerator at the 
Fermi National Accelerator Laboratory in 
Batavia, Illinois, announced on 13 March 
that the particle can be excluded at energies 
of between 160 and 170 GeV, and is likely to 
exist at the lower end of the assumed range. 
At more-crowded lower energies, filtering 
out other debris and finding a rare Higgs 
event becomes more difficult. Scientists 
at the Large Hadron Collider at CERN, 
Europe's particle-physics laboratory near 


NEWS IN BRIEF 


Geneva, had hoped to find the Higgs boson 
in the higher-energy region. 

For a longer version of this story, 

see http://tinyurl.com/cp8avk. 


Hospital investigation 
reveals long-running fraud 


In what could be one of the largest ever cases 
of medical-research fraud, a prominent 
anaesthesiologist has been accused of 
fabricating data in at least 21 papers over 

13 years. 

Scott Reuben fabricated all or some 
of the data in studies as far back as 1996, 
according to an internal investigation by 
Baystate Medical Center in Springfield, 
Massachusetts, where he worked. Journals 
such as Anesthesiology and Anesthesia and 
Analgesia have already retracted Reuben’s 
papers, which supported combining 
painkillers called COX-2 inhibitors (such 
as Merck’s Vioxx and Pfizer’s Celebrex) with 
other analgesics, including Pfizer’s Lyrica, 
to relieve pain after surgery. 

An attorney representing Reuben said that 
the Baystate investigation was confidential, 
that there were extenuating circumstances 
and that Reuben deeply regrets all that has 
happened. Reuben is barred from research at 
Baystate for at least a decade. 
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Science journalism is in 
decline; science blogging 
is growing fast. But can the 


one replace the other, asks 
Geoff Brumfiel. 


ohn Timmer’s slide into journalism was 
so gradual even he can’t put his finger 
on the point at which he stopped being 
a researcher. 

He started reading Internet websites and 
message boards a decade ago, while he was 
working as a postdoc in a developmental neuro- 
biology lab at the Memorial Sloan-Kettering 
Cancer Center in New York. One day, one of his 
favourite sites, Ars Technica, announced that 
it was looking for someone to help with its sci- 
ence coverage. It was 2005, and a school board 
in Dover, Pennsylvania, had gone to court over 
the promotion of intelligent design. “I thought, 
wow, it really feels like the public has completely 
lost touch with what science is all about, says 
Timmer. “So I basically e-mailed the existing 
author and volunteered.” 

Over the next few years Timmer’s work 
on the site grew steadily, while his research 
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career stalled. Today the 42-year-old draws a 
full-time salary as Ars Technica’s science edi- 
tor. He works with writers echoing his earlier 
experience: graduate students and postdocs 
type up brief summaries on research in their 
areas of expertise during down time and lunch 
breaks. The write-ups are more technical than 
you might read in a newspaper — a recent post 
included a lengthy discussion on ‘functional- 
izing’ cells to bind them together with DNA 
— but that’s fine, Timmer says. The idea is to 
provide people already interested in science 
with greater insight into how research works. 
A typical posting can earn a writer anywhere 
from the price of a pair of movie tickets to 
around US$100, and that is often incentive 
enough for young academics. 

Timmer’s tale is emblematic ofa shift in the 
way science meets the media. In part because 
of a generalized downturn, especially in 
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newspaper revenues, the traditional media are 
shedding full-time science journalists along 
with various other specialist and indeed gen- 
eralist reporters. A Nature survey of 493 sci- 
ence journalists shows that jobs are being lost 
and the workloads of those who remain are on 
the rise (for full results see http://tinyurl.com/ 
c38kp6). At the same time, researcher-run 
blogs and websites are growing apace in both 
number and readership. Some are labours of 
love; others are subsidized philanthropically, 
or trying to run as businesses. 


It's a blog world 

Traditional journalists are increasingly looking 
to such sites to find story ideas (see ‘Rise of the 
blogs; page 276). At the same time, they rely 
heavily on the public-relations departments 
of scientific organizations. As newspapers 
employ fewer people with science-writing 
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backgrounds, these press offices are employing 
more. Whether directly or indirectly, scien- 
tists and the institutions at which they work 
are having more influence than ever over what 
the public reads about their work. 

The amount of material being made available 
to the public by scientists and their institutions 
means that “from the pure standpoint of com- 
municating science to the general public, we're 
ina kind of golden age’, says Robert Lee Hotz, 
a science journalist for The Wall Street Journal. 
But that pure standpoint is not, or should not 
be, all that there is to media coverage of science. 
Hotz doubts that blogs can fulfil the additional 
roles of watchdog and critic that the traditional 
media at their best aim to fulfil. That sort of 
work seems to be on its way out. “Independ- 
ent science coverage is not just endangered, it’s 
dying,” he says (see “Vox media, page 277). 

What’s more, the amount of material 
available is not a good proxy for its reach. Press 
releases and blogs will not find the same broad 


audience once served by 
© the mass media, says 
) Peter Dykstra, who was 
executive producer of 
CNN’s science, technol- 
ogy, environment and 
weather unit until it was 
closed down last year. 
Now at the Woodrow 
Wilson International 
Center for Scholars, 
an independent think 
tank in Washington DC, 
he says that science and 
environment news will be 
“ghettoized and available 
only to those who choose to 
seek it out”. 
Science journalism boomed 
in the 1980s and early 1990s. In 
the United States — where by 
1989 some 95 newspapers had 
dedicated science sections — and 
elsewhere, the field’s precipitous 
rise was supported by buoyant prof- 
its in the media sector. “The model of 
Va a major paper was that they did really 
serious science coverage,’ says Deborah 
Blum, who won a 1992 Pulitzer Prize for 
her reporting in the Sacramento Bee on 
the use of animals in research, and who now 
teaches at the University of Wisconsin at Madi- 
son. But there was a problem with the science 
sections, she says. “They didn’t make money.” 
Most papers were willing to support their 
sections, even at a loss, because science was 
the thing to have. Today, in a harsher mass- 
media landscape, that has changed. Across the 
United States, newspaper science sections have 
been shut down: this month The Boston Globe 
stopped running its weekly science and health 
section. Nor is the written word the only casu- 
alty, as the closure of Dykstra’s seven-person 
unit at CNN indicates. Nature’s survey shows 
that, of those working in the United States and 
Canada, one in three had seen staffing cuts at 
their organization (see ‘Hiring practices’). 
The European industry has not yet reached 
the level of crisis seen in the United States, says 
Holger Wormer, a professor of science journal- 
ism at the University of Dortmund in Germany. 
Many newspapers in Germany are considering 
staff cuts but, at the moment, science journalists 
are faring relatively well. “Science departments 
are still small but they are regarded as quite 
important,” he says. Because larger German 
papers such as Frankfurter Allgemeine Zeitung 
have science sections, smaller papers are willing 
to support their own science coverage, at least 
for now. In France, declining circulations are 
also creating problems, according to Stéphane 
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Foucart, a science writer at Le Monde. In the 
past six months, Le Monde has scaled back its 
science coverage. Newspapers and broadcast 
outlets in the United Kingdom are also under 
pressure, and science and environmental jobs 
are among those that have been lost. 

Unsurprisingly, among the science report- 
ers who remain, the workload is on the rise. 
Nature’s survey reveals that 59% of journalists 
have seen the number of items they work on in 
a given week increase over the past five years. 
They are not just doing more reporting, but 
more types of reporting. Many are now being 
asked to provide content for blogs, web stories 
and podcasts — something they weren't doing 
five years ago. 


Fast and dirty 
Under these straitened conditions the main- 
stream media’s need for quick and accurate 
science content is being met primarily by 
public-relations departments, according to 
Fiona Fox, director of the Science Media 
Centre, an organization in London that sup- 
plies journalists with scientific information 
(Nature’s editor-in-chief, Philip Campbell, 
sits on the Science Media Centre’s board, and 
the Nature Publishing Group provides support 
for it). Mark Henderson, science editor for The 
Times, based in London, says that he tries to 
avoid relying solely on releases “as much as 
possible’, but “if there’s a good press release 
and you've got four stories to write in a day, 
youre going to take that short cut”. Nature’s 
survey shows press releases to be a top source 
of story ideas for science journalists, with 39% 
routinely quoting from them directly. 

This demand for stories and ideas has been 
matched by an increase in supply. In Britain as 


HIRING PRACTICES 
Many North American science journalists 
report job losses in the past five years. 
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in the United States, contraction in the media 
has made jobs in public relations particu- 
larly attractive for students at science-writing 
programmes. “Youd be amazed at the diver- 
sity of places for science communicators,’ says 
Blum. Government agencies, universities, 
museums and non-governmental organiza- 
tions have all hired her students, she says — 
almost all of whom are finding jobs, despite 
the woes of the traditional media. 

The Science Media Centre demonstrates the 
new opportunities that exist now. It was started 
in 2002 by an amalgam of non-commercial and 
commercial interests seeking to influence the 
public debate on news topics such as geneti- 
cally modified foods. What began as a rela- 
tively modest attempt to connect journalists 
to sources of scientific expertise has expanded 
dramatically over the past seven years. Today, 
the centre’s six-person staff sends out daily 
e-mails filled with quotes from prominent 
scientists on the latest news that end up in 
tomorrow’ stories. It has also begun provid- 
ing fact boxes and background documents that 
journalists can insert directly into their cov- 
erage. Fox is happy at the centre's success, but 
uneasy too. Ideally, she says, science journalists 
should be picking up the phone and talking to 
scientists directly: “We are successful because 
of a serious problem in journalism, and it’s not 
one to be celebrated.” 


Straight to the masses 

As journalists become more dependent on sci- 
entific public relations, scientists themselves 
have begun reaching out to mass audiences 
through the Internet. Such outreach is not new; 
but unlike books and lectures, science blogs 
operate with a quick turnaround that more 
closely resembles that of the traditional media. 
The most successful sites are drawing hundreds 
of thousands of visitors each month. 
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Many of those blogs were started by scien- 
tists who simply wanted to reach the public 
with information about their research. “I'd 
always find that people were interested in 
what I did,” says Derek Lowe, a researcher with 
Vertex Pharmaceuticals in Cambridge, Mas- 
sachusetts, and author of In the Pipeline, a blog 
about drug discovery and the pharmaceutical 
industry. “Most people have no idea how drugs 
are actually found,” he says. Lowe started his 
blog in early 2002, and now it regularly draws 
around 200,000 page views a week. 

Paul Myers, a biologist at the University of 
Minnesota in Morris, says that he started his 
blog Pharyngula “largely out of boredom’, 
but now that he gets more than half-a-million 
weekly page views, he sees it as a valuable tool 
for talking to a public audience. Myers freely 
admits that his readers 
“are not just there for the 
science” — his attacks on 
religion are a mainstay of 
the blog’s appeal. But he 
certainly considers him- 
self a source of scientifi- 
cally reliable information for his readers. 

Although science blogging did not start off 
as a business, there are attempts to make it one. 
Since 2006, the publisher of Seed, a magazine 
about science, has gathered more than 100 
science blogs — including Pharyngula — on 
a range of topics on to a single website, Sci- 
enceBlogs, and now pays its bloggers on the 
basis of how many hits their posts receive. 
Fabien Savenay, a senior vice-president for 
marketing at Seed Media Group in New York, 
declines to say whether the blog site makes 
money for the organization. But, he says, the 
project “has been a successful franchise for us 
in that it has great traffic and engagement”. 
Another US magazine, Discover, has recently 
been amassing a smaller but impressive stable 
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“It feels like the public has 
completely lost touch with 


what science is all about.” 
— John Timmer 


of bloggers, too. Other 
magazines, such as WIRED, 
prefer a more journalistic approach to 
blogging, using a team of reporters on their 
science blog to provide a pace, range and qual- 
ity of posting no individual could match. 
Bloggers with a science background, 
like bloggers on most other topics, often 
demonstrate open scorn for the main- 
stream media (MSM in blogspeak). 
“You get a press release that is slightly 
rehashed by somebody in the newsroom and 
it goes in the paper! It’s wrong, its sensation- 
alist, it erodes the public trust in scientific 
endeavour,” says Bora Zivkovic, author of A 
Blog Around the Clock on ScienceBlogs and 
an online community manager for the Pub- 
lic Library of Science journals. Myers takes a 
similar view. “Newspapers realize that they can 
get their audience by peddling crap instead of 
real science,’ he says. Not surprisingly, those 
who came to blogging from journalism — 
such as Carl Zimmer, who writes for a range of 
publications, including The New York Times, 
and blogs at Discover — tend to disagree. But 
Larry Moran, a biochemistry professor at the 
University of Toronto, Ontario, who blogs at 
Sandwalk, seemed to speak for many bloggers 
when he recently wrote 
“Most of what passes for 
science journalism is so 
bad we will be better of 
[sic] without it”. 

While journalists such 
as Zimmer expand their 
mainstream work into their blogs, bloggers 
with roots in the lab are moving into print. 
Myers will soon contribute a regular column 
to the Guardian newspaper in the United 
Kingdom. Derek Lowe now writes regular 
columns for The Atlantic and the trade maga- 
zine Chemistry World (both have also written 
for Nature). This work, though, tends towards 
opinion and analysis, not reporting. “Bloggers 
don't want to be journalists,’ says Zivkovic. 
“I want to write on my blog whatever I want. 
I may write a post about a new circadian paper, 
but the next eighty posts are about politics or 
what | ate for breakfast.” Despite his distaste 
for how the trade is practised, he thinks that 
there will always be a need for professional 
journalists covering science. “Somebody has 


Vox media 


More than 100 science journalists 
responding to Nature's survey offered their 
thoughts on the future of the field. Here's a 
sample of what they had to say: 


“Science journalism is dying in the mass 
media. It has always been a niche subject, but 
only those really interested in it will continue 
to purchase specialist science media. Print 
publications will become more niche but will 
survive. TV news and documentaries will 
become dumbed down in order to compete 
with the idiocy on the Internet.” 


“The public remains interested in science. 
They pack science fairs and museums; they 
buy popular science books; they watch TV 
documentaries. But I'm not sure the public's 
appetite for science is so great that people 
need daily science news. So when this or that 
media outlet cuts its science desk, it could be 
in response to what they can now measure 
on their websites: which topics really engage 
the public day to day.” 


“lama scientist who is freelancing 
occasionally for a science popularization 
magazine published by my institute. Most 
of the time, the description of the scientific 
result in a press release is so dumbed 

down that | cannot find out what the result 
actually was in the terms of an expert! 
Instead of dumbing down the science to 
the level of the general public, we should be 
trying to educate the public. “ 


“It has been shocking to see the public come 
to view science news as a bulk commodity. 
Readers seem to make little or no distinction 
between professionally written reports 

from independent news organizations and 
promotional writing masquerading as news on 
various blogs and science ‘news’ websites.” 


“Commercial pressures are polluting science 
journalism. The mainstream media has 
pitifully low standards of science journalism 
where the herd mentality prevails. There is 

a prevailing view among newspaper editors 
that science does not deserve as much 
coverage as other fields, founded probably 
on nothing other than these editors personal 
chip on their shoulder regarding their own 
scientific education.” 


“I'd love to know if the monks were wringing 
their hands over the horrible shallowness of 
thought sure to follow the invention of those 
funny little letter bits squashed on paper with 
apress.” 


G.B. 


to actually be 

paid to write 

about things as they 
come out,” he says. 

That is what John Timmer is look- 
ing for new ways to do at Ars Technica. But 
there is a problem: the online world, both in 
its bloggier reaches and elsewhere, is polarized; 
people go to places they 
feel comfortable. Many 
of the people that Tim- 
mer originally hoped to 
reach when writing about 
intelligent design and the 
Dover trial probably go 
elsewhere for their news, 
he says, because “it’s easy 
for somebody to pick their news sources based 
on their politics, and get that version of sci- 
entific issues”. Dykstra worries that in a more 
fragmented media world, “environmental news 
will be available to environmentalists and sci- 
ence news will be available to scientists. Few 
beyond that will pay attention.” 

Others worry about the less questioning 
approach that comes with a stress on commu- 
nication rather than journalism. “Science is like 
any other enterprise,” says Blum. “It’s human, 
it’s flawed, it’s filled with politics and ego. You 
need journalists, theoretically, to check those 
kinds of things,” she says. In the United States, 
at least, the newspaper, the traditional home of 
investigations and critical reporting, is on its 
way out, says Hotz. “What we need is to invent 
new sources of independently certified fact.” 


Culture mash 
Two Ivy League giants, Princeton University in 
New Jersey and Yale University, are trying to 
do something about the problems they see in 
environmental coverage with websites aimed 
at generating scientifically accurate news cov- 
erage. “We're bringing something new to the 
table,’ says Roger Cohn, a veteran journal- 
ist who now edits the Yale Environment 
360 website, which is funded in part by 
the William and Flora Hewlett Founda- 
tion and the John D. and Catherine T. 
MacArthur Foundation. The site is 
home to reports by journalists and 
opinions by scientists on subjects 
such as climate change, but it has 
“no axe to grind on any one of these 
issues’, says Cohn. 

At the Princeton University website, 
Climate Central, the focus is mainly on 
video material. “We're just in the initial 
stages of preparing a weekly series of 
news stories about climate based on 
papers in journals,” says Michael 
Lemonick, a long-time science “# 
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"Science is just like any 
other enterprise. It's 
human, it's flawed, it's filled 


with politics and ego.” 
— Deborah Blum 
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writer for Time magazine who now works at the 
site. As well as appearing on Climate Central, 
he says, the stories will be offered to the web- 
sites of big media outlets; some of the group’s 
work has already been aired on the Public 
Broadcasting Service's evening news show The 
NewsHour with Jim Lehrer, which reaches mil- 
lions of viewers. Climate Central is funded by 
the Flora Family Founda- 
tion and The 11th Hour 
Project, a non-profit 
organization supporting 
climate awareness, based 
in Palo Alto, California. 

Lemonick says his new 
job requires him to listen 
more closely to research- 
ers. “If they say, ‘you really left out this important 
fact; I don't get to say, ‘Sorry it’s my story,” he 
says. That doesnt mean that researchers make 
his story into a dry scientific paper, he adds. 
“They have to recognize the needs of the jour- 
nalist, but we have to recognize the needs of the 
scientists. We're kind of fusing the two cultures.” 
Timmer’s path has also led him to a fusion of 
science and journalism. In May, media giant 
Condé Nast acquired Ars Technica, and he was 
brought on full-time. “When I’m interacting 
with press officers or researchers, I’m acting asa 
journalist, he says. “I don't think anybody would 
consider me a working scientist any more.’ But 
when asked how he sees the scientists writing for 
him, he becomes more philosophical: “Basically, 
however they see themselves.” a 
Geoff Brumfiel is a senior reporter in Nature's 
London office. 


See Editorial, page 260. 
Full survey data accompany this article online. 
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Archived answers 


The switch to electronic medical records opens up a potential wealth of data for researchers, 
if major obstacles can be overcome, reports Katharine Gammon. 


n May 1999, the US Food and Drug 

Administration (FDA) approved a prom- 

ising new drug. Earlier clinical trials of 

5,000 patients had convinced regulators 
that the drug was a safe and effective means of 
reducing pain and inflammation in arthritis. 
Produced by the pharmaceutical giant Merck, 
the new medication became an instant hit and 
nearly 80 million people worldwide had taken 
this wonder pill by 2004. 

During that same period, however, evidence 
emerged that the painkiller, known as Vioxx 
(rofecoxib), significantly raised the risk of 
heart attacks. By the time Merck pulled the 
drug from the market in September 2004, 
“an estimated 88,000 to 140,000 excess cases 
of serious coronary heart disease probably 
occurred in the USA’, according to a study 
led by an FDA scientist and published in the 
Lancet in 2005 (ref. 1). The drug may have 
caused tens of thousands of deaths in the United 
States alone, the researchers estimated. 

The digital revolution — and the switch 
from paper to electronic medical records — 
might prevent many of these kinds of deaths 
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in the future. In a 2007 study’, researchers from 
the Children’s Hospital Informatics Program 
(CHIP) in Boston, Massachusetts, attempted 
to see how quickly they could have spotted a 
problem with Vioxx using digital records. By 
homing in on two major hospitals, the research- 
ers found an 18% jump in heart attack cases 
within eight months of Vioxx being introduced, 
which could have provided an 
early warning about the drug, 
says Isaac Kohane, an author of 
the study and director of CHIP. 
The excess cardiac cases disap- 
peared when Vioxx was pulled 
from the market. 

The Vioxx incident shows 
one benefit that could emerge 
when health systems dump 
old-fashioned paper medical charts in favour 
of electronic records. Some countries, such 
as the United Kingdom, have already made 
the transition and the United States is mov- 
ing in that direction. This trend could have 
profound implications for how doctors 
treat patients and how researchers pursue 
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scientific questions that rely on medical data. 

“In the long run, the opportunities are 
huge — actually transformative,’ says Ashish 
Jha, assistant professor of health policy at the 
Harvard School of Public Health in Boston. 
“When the exchange of health information 
flows freely between doctors, we can't imagine 
how huge that’s going to be.” 

But it will take time. The 
move towards electronic 
medical records has proceeded 
slowly so far, and many practical 
challenges still lie ahead. When 
Jha and his colleagues surveyed 
US physicians in late 2007 and 
early 2008, they found that just 
4% have a full electronic records 
system, and only 13% have even 

a basic digital system*. “The organization and 
delivery of health care hasn't changed in nearly 
a hundred years,” says Jha. 

That may soon change. The stimulus package 
that recently passed in US Congress contains 
US$19 billion to promote the switch to dig- 
ital health records, mostly through economic 
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incentives for doctors and hospitals 
to make the transition. The lofty 
goal is to have all health records 
stored and accessed electronically 
by 2014. 

President Barack Obama has 
promoted this policy as a way to 
control soaring health-care costs. 
An estimated $81 billion could 
be saved annually if 90% of doc- 
tors used electronic systems — by 
reducing redundant care, speeding 
up patient treatment and stem- 
ming mistakes and handwriting 
foul-ups. And the switch could cut 
more than just costs. By reducing 
the number of medication errors, 
electronic systems can save lives. 


Digital health records can save lives and money, but security is a concern. 
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Cedars-Sinai hasn't been 
the only institution to face 
a rebellion. In the United 
States, electronic medical 
records spark such fierce 
resistance in part because 
those who benefit most 
from these systems — the 
insurance companies and 
billing department — aren't 
the same as the people who 
have to pay for the new 
software and training. For 
a medium-sized hospital, 
these systems can cost tens 
of millions of dollars just for 
software — with no guaran- 
tee of a return on investment 


Digging for data 

Proponents of electronic medical records say 
that researchers could be among the chief bene- 
ficiaries of the conversion to digital health data. 
The fundamental argument for using electronic 
records in scientific studies is that most doctors 
don't see enough patients to recognize patterns. 
Only on the level of hundreds of thousands or 
millions of records do connections become 
apparent. What’s more, research can be slow 
when it comes to recruiting patients, evaluat- 
ing them and checking in over a long period of 
time, but all that information is available just a 
few clicks into an electronic record. 

But critics argue that current health files 
are too unstructured, too unreliable and that 
algorithms for sifting doctors’ notes are not yet 
available. 

There’s also the question of the type of 
information in the record. “Depending on what 
you want to study, that may or may not be con- 
sistently documented,” says Samuel Skootsky, 
medical director of the Medical Group and 
Faculty Practice Group at the University of 
California, Los Angeles. 

Kohane counters that there is enough 
valuable information in records to justify min- 
ing them. What’s more, the gigantic amount of 
patient data available means researchers can 
choose the individuals to study. For example, 
investigators looking at asthma could pluck 
out true asthmatics from people who wheeze 
owing to other factors, he says. 

For research purposes, he says, observations 
in medical records more accurately reflect 
clinical practice. “If you study a drug and you 
look at its effect on a control group, that’s a very 
pure study,” says Kohane. “But everyone tak- 
ing that drug might be also taking two other 
drugs related to their condition’, which could 
alter the effects of the medication being stud- 
ied. That information is not always available 


to researchers conducting trials, but electronic 
medical records might help sort out the multi- 
drug effects. 

Ina test of that potential, Kohane’s group is 
using health records to look at other drugs on 
the market. The team is searching for effects 
that might have been missed in earlier tests of 
the medications. 

Even more power might emerge by 
combining clinical records with a genetics 
database. Michael Swiernik, 
director of medical informat- 
ics at the University of Cali- 
fornia, Los Angeles, says that 
the combination would allow 
researchers to find the right 
people for a study in a speedy, 
cost-effective way. 

Other research affirms that 
it is possible to dig into electronic records for 
useful clinical data. A study released in Feb- 
ruary showed that data captured in electronic 
records could yield the same results as data 
from clinical trials when evaluating the efficacy 
ofa drug’. The data came from Britain's general 
practice research database, which holds about 
10% of the records of general practitioners — 
mostly relating to demographics and pharma- 
ceutical usage. The problem is that these UK 
databases are small, incomplete and are not yet 
linked together. 

Before researchers can truly tap digital 
records, the databases first have to be created 
and filled. That will not happen if the primary 
users reject them, as happened at the Cedars- 
Sinai Medical Center in Los Angeles in 2002. 
After undergoing months of training with a new 
electronic records and communications system, 
doctors complained that it was too cumber- 
some. One day, the staff revolted, ending the 
hospital’s $34-million effort to convert to an 
electronic system after just three months. 
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“What you want to 
study may or may 


not be consistently 
documented.” 
— Samuel Skootsky 


for the hospital. As nearly 
half the doctors in the United States practise 
either alone or with one partner, the cost is 
extreme: $32,000 per physician for installation 
and $1,500 per month for running the system, 
according to estimates. 

When a hospital or doctor goes out to buy a 
system, they face another challenge: there are 
hundreds of vendors who each want to sell a 
different, proprietary product, many of which 
will not survive. In addition, medical records 
systems often have problems 
sharing data with each other. 
Taken together, these issues 
“make doctors and hospitals 
very nervous to purchase the 
technology’, says Jha. 

That apprehension can 
extend to using the systems as 
well: studies show that doctors’ 
efficiency actually drops 10-20% during the 
first year of using an electronic system. 

Then there’s the thorny issue of privacy. 
Moving electronic patient information around 
between doctors, pharmacies, laboratories and 
billing departments means many people have 
access to sensitive information. That raises the 
risk of a large security breach. Medical-record 
information sells for more on the black mar- 
ket than credit-card information, because it is 
more identifying, says Swiernik. 

“Security is only as good as its weakest link, 
and the likelihood of weak links is really pretty 
high,” says Lee Tien, a lawyer at the Electronic 
Frontier Foundation, a digital-rights public- 
interest group based in San Francisco, Califor- 
nia. He is particularly concerned about data 
security because the transition to electronic 
health records is happening so quickly. “If 
youre not hard-wiring privacy and security 
into the DNA of the infrastructure, there will 
be unfortunate data security breaches.” 

Despite these challenges, some institutions 
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all store patient data imported 
from doctors’ offices or labo- 
ratories, and give control of 
the data to the patient. When 
it comes to scientific research, 
Kohane says personal records 


page to a 


plan has 8.6 million mem- 
bers with full electronic 


Consumers are storing their medical data online with systems such as Google Health. 


could connect patients 
directly with researchers with- 


medical records, and nearly 
half of them use the online interface, where 
they can view part of their records. 

But those systems have yet to benefit 
researchers substantially. Because of privacy 
regulations, Kaiser Permanente, based in 
Oakland, California, limits access to its clini- 
cal records mostly to researchers within the 
organization. The VA has also been slow in 
research. “The VA has had electronic data for 
10 years now, but only a small amount of stuff is 
available for digital clinical research,” says Jha, 
adding that the technology and the level of 
comfort with anonymized data sharing hasn't 
yet caught up to the amount of data available. 

The largest electronic medical-records 
system in the world is Britain’s National Health 
Service (NHS), serving more than 60 million 
people. But the digital data amassed by UK doc- 
tors is not yet comprehensively linked up. That 
will change in the next five years, by which time 
electronic records will connect 30,000 general 
practitioners and include data 
on all UK residents. 

Despite that push into the 
digital age, the opportunity 
for research has been missed, 
says John Powell, associate 
professor in epidemiology and 
public health at the University 
of Warwick, UK. “There’s a 
massive debate going on right 
now about consent — whether 
patients are opting in or opting out ofa research 
system. Doctors for the most part are arguing 
that people have to give consent to opt in, civil 
servants in the Department of Health are saying 
they have to opt out. From the research point 
of view, we want to have maximum people and 
maximum data in.” General practitioners tend 
to be wary of sharing patient data, even if they 
are de-identified. 

That concern grew recently, when some 
officials suggested that the NHS could sell 
data to pharmaceutical companies to raise 
funds — a proposal that prompted strong 
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“If you're not hard- 
wiring privacy 
into the DNA of 


the infrastructure, 

there will be data 

security breaches.” 
— Lee Tien 


criticism. Powell says many of the issues will 
be resolved this year, when the NHS finalizes 
its official constitution — a sort of terms-of- 
service agreement for using the system. He 
argues that the system should be used for 
research. “In return for getting completely free 
health care for everything, there are certain 
things you have to sign up for as a resident 
— and one of those things should be second- 
ary use of your data for the anonymous NHS 
research, but not for pharmaceutical compa- 
nies,’ says Powell. 


Security fears 

Britain has faced major data security prob- 
lems in the past few years, so people are 
understandably concerned about the safety 
of their medical information. But there are 
rumblings of change: the creation in February 
of the National Institute for Health Research, 
a framework agency within the NHS, should 
make it easier for anonymized 
data to move within the sys- 
tem. 

Other European nations 
haven't decided what to do 
with their records, either. In 
Germany, doctors have local 
electronic health records, but 
no way to connect them to a 
larger system. France has a 
national system to hold and 
access electronic medical records, although 
clinicians can view only the part of the record 
that is relevant to them. 

Some researchers see an alternative to large, 
expensive systems in something closer to 
Apple’s iPhone, with its panoply of available 
applications. Kohane says that the best system 
would be a platform available with different 
inter-operable, substitutable programs. 

If open-source platform systems aren't built 
because of the nature of the vendors, Kohane 
sees a future in the expansion of personal health 
records — computerized records maintained 
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out hospitals or physicians as 
intermediaries — even offering the anonymity 
required by most review boards. 

Such systems could fix a major problem 
in medical research. Right now, people who 
participate in studies do so anonymously, and 
the results are not usually transmitted back to 
them. Kohane and his colleagues are testing an 
approach later this year that will allow for more 
communication in one Boston clinic with 5,000 
patients, and they are adding more clinics in 
2010. Patients will sign up at their health-care 
institution and upload clinical or genomic infor- 
mation to an encrypted personal health record 
to be shared with researchers. Then patients 
could choose to ‘tune in’ to a certain type of 
announcement regarding their records. 

If results came out that matched their 
genomic profile, patients would get a message 
about that work. They could also sign up for 
getting news on particular conditions, such 
as cancer or diabetes. And researchers could 
communicate with patients without know- 
ing their identities, allowing an ongoing rela- 
tionship. “The current state of research is of 
self-inflicted mutual ignorance,’ says Kohane. 
“Patients are committing to not knowing more 
about the study, and researchers can’t find out 
more information about the patients.” 

All that could change if health systems can 
get past the substantial hurdles. In a world 
where information is at our fingertips, and 
even the US president has his thumbs on a 
BlackBerry, electronic medical records are 
coming, along with the research opportunities 
they hold. For many scientists, that transition 
is just what the doctor ordered. a 
Katharine Gammon is a freelance writer based 
in Santa Monica, California. 
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OPINION 


CORRESPONDENCE 


These letters respond to the News Feature ‘Closing arguments’ (Nature 457, 650-655; 2009), 
about problems faced by laboratories funded by the US National Institutes of Health. 


Diversity of funding 
sources and topics is 
key to survival 


SIR — Reduced funding and 
increased competition among 

a larger body of active researchers 
also affects countries other than 
the United States, and fields other 
than the biomedical sciences. 

Here in the United Kingdom, 
those of us who use the 
Engineering and Physical 
Sciences Research Council 
(EPSRC) for funding have 
witnessed a similar scenario 
to that described in your News 
Feature: a smaller fraction of 
assessed grants are being funded 
and we have reached the stage 
where there is — for all practical 
purposes — a lottery to determine 
which of the many excellent 
proposals submitted will be 
successful. 

The amount of money that is 
available for curiosity-driven 
research is also decreasing, as 
the government seeks to steer 
funding into areas of political 
priority. The response from the 
scientific community is, of course, 
to become more inventive about 
the potential applications of a 
given proposal. 

| believe that the best way 
to maintain research activity is 
through diversity in both sources 
of funding and topic. You indicate 
that undue weight is given to 
National Institutes of Health 
funding (as opposed to general 
funding) in many US institutions 
as a requirement for tenure or 
promotion. This may further 
concentrate academics in 
attempting to access a finite 
pot, with a consequently 
reduced rate of success. 

However, despite the 
disappointment that we may 
feel in not receiving enough 
funding to maintain our activity or 
research-group size at a desired 
level, we academics should 
recognize our privileged tenured 


position in times of economic 
hardship such as the present. 
Brian Derby School of Materials, 
University of Manchester, Grosvenor 
Street, Manchester M17HS 

e-mail: brian.derby@manchester.ac.uk 


Grant-writing offices 
would let scientists 
get on with research 


SIR — The present financial crisis 
is putting many early-career 
scientists in a complicated 
situation, as noted in your News 
Feature and the Editorial ‘Crisis of 
confidence’ (Nature 457, 635; 
2009). | agree with the Editorial 
that “doctoral programmes should 
build better career counselling 
and training into their curricula 
from the start”, but | also believe 
that universities cannot continue 
to expect professors to do 
research and apply for grants and 
patents, as well as teaching. 
People often choose an 
academic career for the freedom 
and creativity of doing research. 
Overburdening them with 
administrative tasks means that 
their creativity is hardly used. One 
solution might be a ‘grant-writing’ 
office that would know all about 
the availability of grants, how they 
might apply to specific topics and 
what they should contain. 
Cecile Perrault Department of 
Biomedical Engineering, 
McGill University, Montréal, 
Québec H3A 1A4, Canada 
e-mail: cecile.perrault@mcgill.ca 


We need more 
insight into what's 
worth paying for 


SIR — An economic argument can 
be made that growth depends on 
the creation of new markets being 
opened up by new technology. 
Personalized medicine provides 
an example of this, as do advances 


in medical instrumentation such 
as magnetic resonance imaging 
and laser eye surgery. 

Balancing the need for 
increased spending to further 
economic growth with the need to 
spend more effectively during this 
economic downturn means that 
we must improve our ability to 
decide what research is worth 
funding. Simply lowering the 
percentage of funded grants won't 
cause this to happen by itself. 

Effective spending depends on 
detailed peer review to assess 
applications properly. But the 
review quality necessary may be 
hard to realize in practice, because 
there is more and more research 
to review. The traditional rankings 
devised from impact factors don't 
take this kind of work into 
consideration — rather, they 
promote more writing, creating 
more work for reviewers. A better 
system would properly credit 
people for their review and 
assessment work so that money 
could be spent more effectively. 
William Gunn Genalyte Inc, 

11760 Sorrento Valley Road, 
San Diego, California 92121, USA 
e-mail: william.gunn@gmail.com 


Backlogged system 
in Australia shuts out 
new investigators 


SIR — The US situation may be 
worse than it once was. But it’s 
still a big step ahead of Australia. 
Total health-related R&D 
spending as a percentage of 
gross domestic product is roughly 
half that of the United States or 
the United Kingdom, according 
to the Organisation of Economic 
Co-operation and Development 
(Statistics and Indicators for 30 
Countries OECD, 2007). Our 
biomedical fellowship scheme is 
backlogged to the point that the 
awards designed to support new 
principal investigators go mostly 
to well-established group leaders. 
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A recent survey of the medical 
research workforce in Australia 
highlights the level of anxiety 
and discontent over poor career- 
development opportunities and 
lack of funding (M. Kavallaris et al. 
Med. J. Australia 188, 520-524; 
2008). In the preceding five years, 
for example, 6% of respondents 
had already left active research; 
73% were considering leaving. 
This presents a clear challenge for 
recruitment and retention of a 
highly skilled workforce. 

To keep Australian science at 
the leading edge, we need to 
maintain the flow of fresh ideas by 
ensuring opportunities for both 
new and established investigators. 
The new US administration has 
recognized the loss of career- 
development opportunities 
caused by underfunding. Australia 
runs the risk of failing to capitalize 
on significant government 
investments made so far. The 
recent announcement of a federal 
fellowship scheme for mid-career 
researchers hints at a potential 
improvement. But the near-term 
prospects for aspiring new 
investigators and those returning 
from overseas are not promising. 
Darren Saunders Department of 
Pathology & Laboratory Medicine, 
University of British Columbia; and 
Molecular Oncology & Breast Cancer 
Program, BC Cancer Research Centre, 
Vancouver, BC V5Z1L3, Canada 
e-mail: dsaunders@bccrc.ca 


In February 2009, the director of the 
US National Institute of Neurological 
Disorders and Stroke moved the 
institute's funding cut-off for grant 
applications from the 10th to the 11th 
percentile after a re-evaluation of 
financial obligations. The change 
meant that Darcy Kelley (quoted in 
this News Feature) and another seven 
investigators whose applications 
had fallen below the cut-off in recent 
funding rounds were funded — 
Editor, Nature. 


Further discussion is welcome at 
http://tinyurl.com/cf59fq. 
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Do nations go to war over water? 


Wendy Barnaby was asked to write a book about water wars — then the facts got in the way of her story. 


The United Nations warned as recently as last 
week that climate change harbours the poten- 
tial for serious conflicts over water. In its World 
Water Development Report’ of March 2009, it 
quotes UN Secretary-General Ban Ki-moon 
noting the risk of water scarcity “transforming 
peaceful competition into violence” It is state- 
ments such as this that gave birth to popular 
notions of ‘water wars: It is time we dispelled 
this myth. Countries do not go to war over 
water, they solve their water shortages through 
trade and international agreements. 

Cooperation, in fact, is the dominant 
response to shared water resources. There are 
263 cross-boundary waterways in the world. 
Between 1948 and 1999, cooperation over 
water, including the signing of treaties, far out- 
weighed conflict over water and violent conflict 
in particular. Of 1,831 instances of interactions 
over international freshwater 


Allan had made the same assumption a few 
decades earlier when he set out to study the 
water situation in Libya. By the mid-1980s, 
water stress in North Africa and the Middle 
East had worsened; but Allan began to ques- 
tion his assumptions when he found no sign 
of the widely predicted water wars. Instead, the 
burgeoning populations of the Middle East- 
ern economies had no apparent difficulties 
in meeting their food and water needs. Allan 
had been forced to grapple with a situation in 
which people who are short of water do not 
necessarily fight over it. 


Invisible water 

Allan’s earlier thinking about water wars began 

to change after meeting the late Gideon Fishel- 

son, an agricultural economist at Tel Aviv Uni- 

versity, Israel. Fishelson argued that it is foolish 
for Israel, a water-short country, 


resources tallied over that time “| five decades, to grow and then export prod- 
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from unofficial verbal exchanges cados, which require a lot of 
to economic agreements or mili- NO formal water to cultivate. Fishelson’s 


tary action), 67% were coopera- 
tive, only 28% were conflictive, 
and the remaining 5% were neu- 
tral or insignificant. In those five 
decades, there were no formal declarations of 
war over water’. 

I learned this the hard way. A few years ago, 
[had just written a book about biological war- 
fare’ and the publishers were keen for me to 
write another. “How about one on water wars?” 
they asked. It seemed a good idea. The 1990s 
had seen cataclysmic forecasts, such as former 
World Bank vice-president Ismail Serageldin’s 
often-quoted 1995 prophecy that, although 
“the wars of this century were fought over 
oil, the wars of the next century will be fought 
over water”. 

This and similar warnings entered the zeit- 
geist. Tony Allan, a social scientist at King’s 
College London and the School of Oriental 
and African Studies (SOAS) in London, sum- 
marized’ the not-so-subtle argument as “if you 
run out of water you reach for a Kalashnikov or 
summon the air strike”. 

Thad no difficulty finding sources to back up 
this argument, and I set about writing chapters 
on the Jordan, the Nile and the Tigris—Euphra- 
tes river systems. My chapter choice relied on 
what seemed a perfectly reasonable assump- 
tion: that water scarcity was governed by the 
presence or absence of flowing water. 
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declarations of war 
over water.” 


work prompted Allan to realize 
that water ‘embedded’ in traded 
products could be important in 
explaining the absence of con- 
flict over water in the region. 

As a global average, people typically drink 
one cubic metre of water each per year, and 
use 100 cubic metres per year for washing and 
cleaning. Each of us also accounts for 1,000 
cubic metres per year to grow the food we eat. 
In temperate climates, the water needed to pro- 
duce this food is generally taken for granted. 
In arid regions, Allan described how people 
depend on irrigation and imported food to 
fulfill these needs. Imported food, in particluar, 
saves on the water required to cultivate crops. 

The relationship of food trade to water 
sustainability is often not obvious, and often 
remains invisible: no political leader will gain 
any popularity by acknowledging that their 
country makes up the water budget only by 
importing food. Allan saw through this to 
document how the water budgets of the Middle 
East were accounted for without conflict. 

Allan wrote about embedded water for a few 
years without it exciting any comment. Then, 
on a dark Monday afternoon in November 
1992, during a routine SOAS seminar, some- 
body used the term ‘virtual’ water to describe 
the same concept. Allan realized this attention- 
grabbing word, in vogue with the computer- 


© 2009 Macmillan Publishers Limited. All rights reserved 


literate younger generation, would catch on 
better than his own term. And he was right: 
“From there on it flew,’ he says. 

Allan’s work explained how, as poor coun- 
tries diversify their economies, they turn away 
from agriculture and create wealth from indus- 
tries that use less water. As a country becomes 
richer, it may require more water overall to sus- 
tain its booming population, but it can afford 
to import food to make up the shortfall’. 

Areas seemingly desperate for water arrive 
at sustainable solutions thanks to the import 
of food, reducing the demand for water and 
giving an invisible boost to domestic supplies. 
Political leaders can threaten hostile action if 
their visible water supplies are threatened (a 
potentially useful political bluff), while not 
needing to wage war thanks to the benefits 
of trade. 


Sources of war 

Israel ran out of water in the 1950s: it has not 
since then produced enough water to meet all 
of its needs, including food production. Jor- 
dan has been in the same situation since the 
1960s; Egypt since the 1970s. Although it is 
true that these countries have fought wars with 
each other, they have not fought over water. 
Instead they all import grain. As Allan points 
out, more ‘virtual’ water flows into the Middle 
East each year embedded in grain than flows 
down the Nile to Egyptian farmers. 

Perhaps the most often quoted example of 
a water war is the situation in the West Bank 
between Palestinians and Israel. But as Mark 
Zeitoun, senior lecturer in development studies 
at the University of East Anglia in Norwich, UK, 
has explained, contrary to what both the mass 
media and some academic literature say on the 
subject, while there is conflict and tension — as 
well as cooperation — there is no ‘water war’ 
here either’. 

Ten million people now live between the Jor- 
dan River and the Mediterranean Sea. If they 
were to be self-sufficient in food, they would 
need ten billion cubic metres of water per year. 
As it is, they have only about one-third of that: 
enough to grow 15-20% of their food. They 
import the rest in the form of food. When it 
comes to water for domestic and industrial use, 
the rainfall and geology of the West Bank alone 
should provide enough water for the popula- 
tion there: Ramallah has a higher annual aver- 
age rainfall than Berlin. But today, water for 
even these needs is scarce. 
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Power struggles and politics have led to overt 
and institutionalized conflict over water — but 
no armed conflict, as there is over borders and 
statehood. Instead, Palestinian and Israeli water 
professionals interact on a Joint Water Commit- 
tee, established by the Oslo-II Accords in 1995. 
It is not an equal partnership: Israel has de facto 
veto power on the committee. But they con- 
tinue to meet, and issue official expressions of 
cooperation, even in the face of military action. 
Inequitable access to water resources is a result 
of the broader conflict and power dynamics: it 
does not itself cause war. 

The Nile Basin Initiative, launched in 1999 
and encompassing nine nations, is another 
example of the way in which wider geopoliti- 
cal and economic factors help to balance water 
allocation. Historically, vast differences in the 
political clout of nations across which, or along 
which, a river flows have resulted in unequal 
water division. Under the 1959 Nile Waters 
Agreement between Egypt and Sudan, Egypt 
has had rights to 87% of the Nile’s water, with 
Sudan having rights to the rest. Ethiopia, whose 
highlands supply 86% of Nile water, does not 
even figure in the agreement: continuing con- 
flicts weakened the agreement to a point where 
Ethiopia has been unable to press a claim. But 
Egypt’s desire to consolidate its economic 
development necessitates that it now come to 
better terms with its neighbours, improving 
prospects for local trade. So Egypt is willing to 


engage in the multilateral initiative to 
cooperate more on matters such as hydroelec- 
tric power development, power-sharing coop- 
eratives, river regulation and water-resources 
management. 

Likewise, although India and Pakistan have 
fought three wars and frequently find them- 
selves in eyeball-to-eyeball confrontation, the 
1960 Indus Waters Treaty, arbitrated by the 
World Bank, has more than once helped to 
defuse tensions over water. 


Oil and water don't mix 

Yet the myth of water wars persists. Climate 
change, we are told, will cause water short- 
ages. The Intergovernmental Panel on Climate 
Change estimates that up to 2 billion people 
may be at risk from increasing water stress by 
the 2050s, and that this number could rise to 
3.2 billion by the 2080s’. 

Water management will need to adapt. 
But the mechanisms of trade, international 
agreements and economic development that 
currently ease water shortages will persist. 
Researchers, such as Aaron Wolf at Oregon 
State University, Corvallis, and Nils Petter 
Gleditsch at the International Peace Research 
Institute in Oslo, point out that predictions of 
armed conflict come from the media and from 
popular, non-peer-reviewed work. 

There is something other than water for 
which shortages, or even the perceived threat 
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of future shortages, does 
cause war — oil. But the 
strategic significance 
of oil is immeasurably 
higher than that of water. 
Serious interruptions of 
oil supplies would stop 
highly developed econo- 
mies in their tracks. Oil is 
necessary for a developed 
economy, and a devel- 
oped economy provides 
for all the needs of its 
citizens, including water. 
People in developed 
economies do not die 
of thirst. 

My encounter with 
Allan’s work killed my 
book. I offered to revise its 
thesis, but my publishers 
pointed out that predict- 
ing an absence of war over 
water would not sell. 

Book or no book, it is 
_- still important that the 

popular myth of water 
wars somehow be dispelled 
once and for all. This will not 
only stop unsettling and incorrect predictions 
of international conflict over water. It will also 
discourage a certain public resignation that cli- 
mate change will bring war, and focus attention 
instead on what politicians can do to avoid it: 
most importantly, improve the conditions of 
trade for developing countries to strengthen 
their economies. And it would help to convince 
water engineers and managers, who still tend 
to see water shortages in terms of local supply 
and demand, that the solutions to water scar- 
city and security lie outside the water sector in 
the water/food/trade/economic development 
nexus. It would be great if we could unclog 
our stream of thought about the misleading 
notions of ‘water wars. a 
Wendy Barnaby is editor of People & Science, 
the magazine published by the British Science 
Association. 
e-mail: w.barnaby@btinternet.com 
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When wise words are not enough 


Thomas Homer-Dixon argues that opinion-makers must demonstrate a better grasp of how societies rise 
and fall if they are to steer nations successfully through many of this century's major crises. 


These are bewildering times. One moment 
the global economy is booming and stock 
markets are soaring; the next, trillions of dol- 
lars of wealth have vanished, and we are on the 
cusp of a global depression. Oil prices rocket 
upwards as leading oilmen talk of worldwide 
shortages, then they plummet amid a world- 
wide glut. The United States seems to be in 
terminal decline, written off by pundits as a 
has-been hegemon, only to receive a new lease 
of life, many perceive, with the election of an 
extraordinary leader. 

In a world reeling from surprise, where 
once-in-a-lifetime events seem to happen 
every month, two things seem to be con- 
stant. The first is the inadequacy of expertise. 
Although the people we have anointed as 
experts might not admit it, they are as bewil- 
dered by the world’s turbulence as the rest of 
us. They are also little better at predicting what 
is going to happen next. The second constant 
is a pervasive feeling of insecurity. The things 
we assume to be bedrock truths around which 
we can organize our lives — scientific theory, 
moral precepts, political institutions or perhaps 
the timeless rhythms of nature — seem to be 
increasingly under assault. 

In two books that offer erudite assessments 
of the dangers facing humankind this century, 
Vaclav Smil and Chris Patten address these 
matters in sharply different ways. In Global 
Catastrophes and Trends, Smil, a Canadian 
scientist of prodigious 
productivity and 
extraordinary disci- 
plinary breadth, basi- 
cally says “get used to 
it”. Many of the vital 
natural and social 


The Next Fifty Years 
by Vaclav Smil 


systems aroundusare Twenty-First Century 
so complex that deep by Chris Patten 
uncertainty character- Allen Lane: 2008. 496 pp. £25 


izes their behaviour, 

and predicting this behaviour is near impos- 
sible. Thankfully, many of the threats to our 
wellbeing highlighted by the media are exag- 
gerated — often wildly so. Although there are 
reasons for concern about where humankind 
is going, we need to remember that insecurity 
is part of the human condition. Catastrophe is 
too, but it is less likely than we imagine. Over- 
all, given the admirable human capacity to 
adapt and change, the human prospect is far 
brighter than many assume. 
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Global Catastrophes and Trends: 


MIT Press: 2008. 320 pp. $29.95, £19.95 


What Next? Surviving the 


The biggest threats to our existence are likely to be pandemics and wars, not asteroid hits or terrorism. 


Patten, the last governor of Hong Kong, a 
cabinet minister in Margaret Thatcher's UK gov- 
ernment and currently the co-chair of the Inter- 
national Crisis Group, a conflict-analysis agency, 
is less sanguine about the formidable challenges 
we face — including climate change, resource 
depletion, global criminality, widening gaps 
in wealth and possible nuclear terrorism. But 
in his book What Next?, he shares Smil’s opti- 
mism about humankind’s capacity to cope. With 
scientific practices and the reasoning brain that 
evolution has given 
us, we can understand 
the basic contours of 
our challenges, if not 
predict exactly how 
they will shape our 
future. With freedom, 
democracy, the rule of 
law and strong, com- 
petent states — all 
legacies of the brutal 
process of past European state building — we 
have the social and institutional architecture 
needed to find and implement solutions. These 
powerful assets allow us to choose between 
better and worse futures. The open question is 
whether we have the will to choose correctly. 

Both books are grounded in the mastery 
of a staggering amount of knowledge — in 
Smil’s case, largely technical and scientific; 
in Patten’s, more social and political. Each 
is steeped in the wisdom that comes from 
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decades of experience and reflection. And 
each weaves argument and evidence around 
a clear structure of analysis. 

Yet both books ultimately disappoint, for 
reasons that say less about the books them- 
selves than about the largely unrecognized 
gravity of humankind’s current predicament. 

Smil’s analysis is guided by a clear time hori- 
zon and a simple distinction. What, he asks, 
are the factors that could plausibly and fun- 
damentally affect the human condition in the 
next five decades? He distinguishes between 
catastrophes and trends — between “fatal dis- 
continuities” that could cause a sudden turn 
for the worse in the human prospect, and 
long-term incremental change that is barely 
noticed year-on-year but that could, over time, 
alter deep characteristics of our world. 

Smil’s discontinuities include large asteroid 
impacts, mega-eruptions of volcanoes, influ- 
enza pandemics, transformational wars and 
terrorist attacks. After assessing the best evi- 
dence available, he concludes that the risks of 
pandemics and major wars are large enough 
— approaching 100% for a war that will kill 
five million people sometime in the next 
50 years — to justify significant investment in 
prevention. We should worry much less about 
asteroids and volcanoes, and current evidence 
does not justify our preoccupation with terror- 
ism. He is indefensibly fatalistic about nuclear 
terrorism. Prevention, he says, cannot thwart 
all attacks, deterrence does not work against 
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zealots and we cannot effectively prepare for an 
attack in advance. But in reality, much can be 
done to prevent a nuclear strike in a major city; 
most importantly, nations can work harder to 
secure and ultimately destroy the world’s stock- 
piles of highly enriched uranium. 

As for unfolding trends, Smil includes in his 
list humankind’s transition away from fossil 
fuels as a main energy source, shifting power 
relations among major nations, widening eco- 
nomic inequalities, climate change, perturba- 
tions of the global nitrogen cycle and antibiotic 
resistance. These trends do not sit together 
easily — for example, some implicate Earth’s 
natural systems whereas others are mostly the 
product of human social and economic interac- 
tions — so Smil’s analysis often comes across as 
arather indigestible series of apples and oranges. 
Within the same chapter, for instance, he jumps 
from a treatment of the intrinsic power-density 
limits of renewable energy to an assessment of 
the geopolitics of world order. 

Of greater concern is Smil’s tendency to slip 
into polemic. His discussions of constraints 
on conventional oil supply and of climate 
change are selective and sometimes simply 
wrong. For example, he dismisses the possi- 
bility of abrupt climate shifts, based on a mud- 
dled interpretation of the science of millennial 
climate events recorded in ice cores (known as 
Dansgaard-Oeschger oscillations); he down- 
plays recent research that strongly suggests 
sea levels will rise much faster than projected 
by the Intergovernmental Panel on Climate 
Change; and he wrongly suggests that the Gulf 
Stream is not driven partly by thermohaline 
circulation and that it does not contribute to 
Europe's warmth. He also repeats, as an illus- 
tration of how past predictions can be wildly 
wrong, the myth that scientists largely sup- 
ported the early 1970s theory that global cool- 
ing was imminent. 

Patten cannot claim Smil's technical expertise, 
but his book is more balanced. He acknowledges 
his biases and the limits of his knowledge, and 
gives fair consideration to contrary views. Some 
of his passages, especially those on China and 
India, are wonderfully rich. And his prescrip- 
tions are simultaneously deeply humane and 
pragmatically bounded by a long familiarity 
with the ways of the world — no mean feat. 

Yet too often What Next? seems laboriously 
descriptive, absorbed in details of how we got 
to where we are, rather than focusing on what 
might happen later this century. The book’s 
time horizon is only a few decades into the 
future, so its subtitle is misleading. And one 
is left with the disquieting sense that the state- 
based apparatus that Patten calls on to save 
humankind is not remotely up to the task. 
Patten is right to admire the power and 


potential of Western institutions, but the 
challenges we now face, such as climate change, 
are decidedly different from those that such 
institutions evolved to address. 

This brings us to why these books ultimately 
disappoint: in each case the analysis, although 
clearly structured, is not guided by an underly- 
ing theory of societal crisis. True, such a theory 
would not give the authors the power to predict 
the future with any precision, but it would help 
them to identify which combinations of factors 
might lead humankind over the precipice, and 
what we might do to avoid such a fate. 

Recent research has highlighted fundamen- 
tal causes of societal crisis. These include the 
convergence of interacting stresses that over- 
loads the coping capacity of multiple com- 
ponents of society, especially the state; rising 
social and technical complexity that generates 
diminishing marginal societal returns; the 


rapidly escalating cost of getting energy; high 
connectivity among people, institutions and 
technologies; and the declining redundancy 
and resilience of crucial systems. 

One can make a credible argument, 
grounded in just the kind of evidence Smil 
and Patten use, that all of the above factors are 
now acting powerfully in humankind’s tightly 
coupled, planetary, socio-ecological system. In 
the absence ofa serviceable theory of societal 
crisis, in the end both books present little more 
than a laundry list of things we should worry 
about. For our children’s and grandchildren’s 
sake, we need much more, and fast. r | 
Thomas Homer-Dixon is at the Balsillie School 
of International Affairs, Waterloo, Ontario 
N2L 6C2, Canada, and author of The Upside of 
Down: Catastrophe, Creativity and the Renewal of 
Civilisation. 
e-mail: tad@homerdixon.com 


Universe In a box 


Joseph Cornell and Astronomy: A Case 
for the Stars 

by Kirsten Hoving 

Princeton University Press: 2008. 

336 pp. $49.50, £35 


Sculptor Joseph Cornell made boxes. Intricate, 
three-dimensional montages of photographs, 
sky maps and paintings, neatly packed into 
wooden cases full of bric-a-brac: eggs, pipes, 
glasses, shells, stamps, chains and whatever 
objects fitted with his internal logic. And, even 
though he was considered one of the American 
exponents of surrealism, - 
that epitome of irration- 
ality, there was a logic. 
In a mesmerizing, if 
mildly flawed, attempt to 
immerse us in Cornell’s 
crafted universes, art 
historian Kirsten Hoving 
uses the artist’s fascina- 
tion with astronomy to 
tease out the logic that 
underlies his work. 
Cornell’s life was com- 
plicated. Although he was 
from an affluent back- 
ground, he had to sup- 
port his younger brother 
who suffered from cere- 
bral palsy, and until 
the late 1940s worked 
variously as a salesman, 
textile designer and in 
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Joseph Cornell's work mirrors the 1960s 
US cultural fascination with astronomy. 


a plant nursery. He was a staunch believer in 
the teachings of Mary Baker Eddy, the founder 
of Christian Science, and never forged a long- 
lasting relationship with a companion, spend- 
ing most of his life in the same house in Utopia 
Parkway, a working-class neighbourhood of 
Flushing in Queens, New York. Yet, at the same 
time, his boxes and experimental films were 
admired in the high-octane art scene of New 
York, where he took part in the first surrealist 
exhibition at the Museum of Modern Art. Mar- 
cel Duchamp introduced him into the orbit of 
Peggy Guggenheim and her coterie, and from 
the early 1950s onwards, he was able to make a 
living through his art. 
Hoving does something 
ambitious and difficult: she 
identifies one important 
thread of his creative pro- 
cess and uses it to help us 
understand Cornell’s art. 
Astronomy clearly played an 
important part in his work 
and life. His boxes, films and 
the countless dossiers that 
he stored at his house were 
littered with star maps, ref- 
erences to Albert Einstein 
and Arthur Stanley Edding- 
ton, solar eclipses and his 
fascination with space 
travel. Hoving uses these 
obsessions to relate a body 
of work that spans many 
decades. Albeit an effective 
choice, astronomy is only 
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one thread of many, and it competes against 
his fascination with the natural sciences, art, 
poetry, modern dance and popular culture. Is 
one strand enough to understand him by? 
The book’s relentless focus on astronomy 
is to the detriment of a more comprehensive 
view of Cornell as an artist and as a man. Very 
little is said about his personal life, with only 
fleeting references to his reclusiveness, his close 
relationship with his brother and mother and 
his romantic trysts in later life. To compensate, 
Hoving extends her prose, clogging up the flow 
of the narrative. She feels obliged to describe 
the minutiae of individual artworks, to pep- 
per the text with vignettes on cosmology and 
astronomy, and to make the case for a few of 


the more tenuous links between his work and 
astronomy. To be fair, Hoving’s arguments are 
on the whole convincing and at times enthral- 
ling, and her access to the debris of Cornell's life 
puts her in a unique position to back up many of 
her claims. But her thesis could have been made 
in half the number of pages. This is certainly a 
case in which less would have been more. 

Yet, with its high-quality production and 
beautiful and wide-ranging illustrations, 
the book is extremely absorbing. I see it as 
Hoving’s attempt to construct her own box, 
a carefully crafted piece laden with excerpts 
and images that penetrate Cornell’s world 
and his obsessions. Furthermore, it is a show- 
case for the enthusiasm with the modern that 


pervaded American popular culture during 
the twentieth century. By including news- 
paper articles and advertisements of the time 
in which astronomy and the space race were 
prominent, Hoving shows that Cornell was 
mirroring what he saw around him. 

Ata time when the interplay of science and art 
is ever more present in cultural life, and we begin 
to ask ourselves whether much of what is done is 
any good, Joseph Cornell and Astronomy gives us 
a good example of what works best: quiet fasci- 
nation and obsession allied with genius. a 
Pedro Ferreira is a professor in astrophysics at the 
University of Oxford, Keble Road, Oxford OX13RH, 
UK, and author of The State of the Universe. 
e-mail: p.ferreiral@physics.ox.ac.uk 


A romp through science fiction 


Nanovision: Engineering the Future 
by Colin Milburn 

Duke University Press: 2008. 

296 pp. $22.95, £17.99 


Nanotechnology, we are often told, will change 
every aspect of our lives. It is easy to hype 
such claims because of the long timescales 
for approving a new technology for commer- 
cialization. The science itself can sound like 
science fiction at times, and fears about how 
nanotechnology will be realized in the future 
crop up in works of fiction. In Nanovision, 
historian of science Colin Milburn proposes 
that such speculation is an integral part of 
nanotechnology. 

Milburn examines nanotechnology in a 
wealth of science-fiction texts, investigating 
such themes as the manipulation of matter, 
artificial life forms and the spread of nano- 
bots as a ‘grey goo’ that ultimately consumes 
Earth. Many writers portray the science ina 
negative light, as the enabler of humankind’s 
destruction or enslavement. Other imagina- 
tive works simply exploit differences between 
the physical laws that rule the nanoworld and 
the macroworld we live in, such as the effect 
of surface tension on a microscopic fictional 
organism. Milburn also looks at individuals 
who have straddled the fields of science and 
sci-fi, including molecular engineer and futu- 
rologist Eric Drexler, and carbon-nanotube 
discoverer Richard Smalley, who envisioned 
an elevator made of nanotubes stretching from 
Earth into space. 

The book fills gaps that most nanoscientists 
have in their knowledge about the history of 
a field that has evolved so fast as to effectively 
obscure its founders. However, Milburn tries 
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too hard to prove the mix of science and fiction. 
In focusing on nanotechnology, he neglects 
the fact that forward thinking, technology 
forecasting and hyped claims are common to 
many scientific disciplines, such as genetics, 
energy research and space flight. More wor- 
ryingly, he confuses theoretical science and 
fiction, missing the point that theories are 
not just speculation but are underpinned by 
mathematics, physical laws and reproducible 
calculations. His analysis also takes too liter- 
ally the timetables and roadmaps contained in 
grant proposals and the like; documents that 
scientists consider to be indicative rather than 
prescriptive. 

Some parts of the book, even the introduc- 
tion, are difficult to follow. The writing is 
shrouded in the complex language of post- 
modern literary theory, and is often as dense as 
a hard-core physics paper. Milburn’s references 
to gendered themes of male domination, even 
when discussing scientific instruments such 
as the scanning tunnelling microscope, are 


sometimes involuntarily comical. Yet he 
ignores the large number of women in nano- 
science and their views: no female nanoscien- 
tists are cited in the book. 

Nanovision is not the place to seek clues 
about where nanotechnology is going, as it 
contains few scientific developments past the 
start of this decade. Beyond the hype, many 
useful applications — ranging from self-clean- 
ing surfaces to sunscreen lotion containing 
nanoparticles of titanium dioxide, to nano- 
composite materials in the car industry — are 
already on the market. Furthermore, in con- 
trast to fears that the world will be devoured 
by grey goo, studies have shown that public 
perceptions of nanotechnology are mostly 
positive. Although there are real and tangible 
risks regarding the toxicity of nanomaterials, 
the scaremongering that fascinates Milburn 
is not such a pressing concern. But if you 
like a romp through themes that mix current 
nanoscience and literature in interesting ways, 
Milburn’s book is a valuable read. Perhaps 
reality is stranger than fiction. a 
Stefano Tonzani is materials science editor at 
Nature. 


Nanotechnology is often portrayed in fiction as destructive, but public perceptions are more positive. 
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Disturbing the Arkansas river || 


Over The River, A Work in Progress 
Fondation de |'Hermitage, Lausanne, 
Switzerland 

Until 24 May 2009. 


The latest idea of the artists famous for wrapping 
Paris's Pont Neuf and Berlin's Reichstag in plas- 
tic is to cover parts of a 64-kilometre stretch of 
the Arkansas River in Colorado in aluminium- 
coated polypropylene. Christo sketched his first 
preparatory drawing for Over the River in 1992, 
but it won't come into being until the summer 
of 2012 at the earliest; hence the addendum to 
the title of this travelling exhibition, showing in 
Lausanne, Switzerland, until 24 May. 

Although Christo does the drawings, he and 
his wife Jeanne-Claude are full artistic partners 
in every one of their “gentle disturbances”. This 
is how they refer to their projects, which tem- 
porarily alter the relationship between sky and 
earth. Over the River will be in place in Colo- 
rado for just two weeks, after which it will be 
dismantled and recycled. All that will remain 
will be a few steel anchorage points, buried deep 
in the rock beneath the artificially raised banks 
that support the railway on the north side of the 
river and US Route 50 on the south. 

The canopy, consisting of around 900 panels 
and nearly 84,000 square metres of fabric, will 


be broken up in places to make way 
for bridges, trees, rocks and other 
aesthetic interruptions. It will be 
translucent, so that people floating 
beneath it in rafts, canoes and kay- 
aks can look up and see how the sky 
is changing. And when the wind 
blows, the artists say, it will undu- 
late and shimmer “like a second, aerial river”. 

Christo and Jeanne-Claude chose the Arkan- 
sas River in 1996, having spent four years, on and 
off, reconnoitring in the Rocky Mountains. Full- 
sized and wind-tunnel tests followed, as well as 
an assessment of how the project will affect the 
environment — nothing like it has been done 
before. How will it withstand the temperatures 
in excess of 35°C that are regularly recorded in 
Colorado in July? How will the mountain fauna 
respond to this disturbance? 

A selection of Christo’s preparatory drawings, 
as well as collages (pictured) of maps, photo- 
graphs, pencil sketches and mock-up fabric pan- 
els, form the substance of the exhibition, along 
with black-and-white photos of people looking 
thoughtful on river banks. From these, Jeanne- 
Claude stands out with her frizzy hair and enor- 
mous hat, reminiscent of a beekeeper’s. 

As impressive as the project itself are the 
hoops the pair have jumped through to try to 
get permission to bring it off. This is chronicled 
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in a film diary that Wolfram Hissen started § 
in 1994. The artists patiently put their case to 9 
bureaucrats and local residents who mostly 2 
seem to want to understand and to help; yet in S 
an echo of creationists’ objections to Charles 
Darwin's theories, some voice their anger at 
what they see as a couple of blasphemous ego- 
maniacs bent on improving “God’s work”. 

In the film, Christo explains that, having lived 
in Bulgaria when it was a Communist state, he 
has an urge to create art that has no ulterior 
motive. He and his wife have therefore devoted 
their lives to championing expensive, time- 
consuming and ephemeral projects. It may be 
their last — they are both more than 70 years 
old — and it may never happen. The locals will 
have the last word, and Hissen’s film leaves you 
in doubt as to what that word will be. If Over 
the River ever sees the light of day, it will have 
emerged from its own chrysalis — an even big- 
ger wrapping made entirely of red tape. a 
Laura Spinney is author of the novel The Quick. 


Not one to watch 


Watchmen 
Film directed by Zack Snyder 
In UK and US cinemas now. 


Owing to complicated, interlocking storylines 
coupled with a gruesome take on humanity, 
Alan Moore's comic-book novel Watchmen was 
long considered to be unfilmable. Director Zack 
Snyder has at last made a film version. But the 
title should have stayed on the printed page. 

Watchmen depicts the rise and fall of superhe- 
roes in an alternative 1985 in which the United 
States won the war in Vietnam, Richard Nixon 
is in permanent rule, the world is careening 
towards nuclear war, anda terrible lab accident 
created Doctor Manhattan, a super-being capa- 
ble of manipulating the quantum universe. 

The depth of character and story development 
in the 12-issue series, published by DC Comics 
in 1986-87, transformed the genre. The books 
were brutal, dystopian and had a deeply pessi- 
mistic view of the human condition. Watchmen 


Quantum morality: Dr Manhattan in Watchmen. 


grappled with intractable moral issues, many 
arising from cold-war apocalyptic nightmares. 
It asked if humanity would destroy itself, if that 
self-destruction could be prevented, and at what 
cost — and it questioned the role of science in 
preservation versus destruction. 

Some elements of Watchmen’s failure as a film 
were predestined. It sinks beneath the weight of 
backstory — necessary for the uninitiated but 
yawningly dull. The most colossal blunder is the 
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casting of Matthew Goode as the smartest man 
in the world, Adrian Veidt, also known as the 
superhero Ozymandias. Veidt, who propels the 
story to its awful conclusion, should be magnetic 
but instead comes across as repulsively foppish. 
It is impossible to accept him as the executor of 
the most diabolical practical joke ever. 

The end of the film fails, especially in terms 
of science ethics. In the book, saving the world 
from nuclear holocaust is accomplished through 
abhorrent yet inspired acts of technological and 
artistic creation. The result shocks — a Machia- 
vellian decision with cataclysmic consequences, 
leaving the reader to dine on a stew of ambigu- 
ity. Was this murderous apotheosis of science 
ultimately required for the preservation of 
humanity? What is the cost to society of Veidt’s 
salvation plan, or even of modern efforts such 
as the International Space Station? In the film, 
the creative act is replaced by one of theft and 
dissimulation, and the choice for the viewer is 
stripped of subtlety. 

The bookand film are summarized by asking 
“Who watches the Watchmen?” For the film, the 
answer should be, not many. Buy the book. m 
Michael White is an associate editor at Nature. 
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Handedness in quick time 


Patrick H. Vaccaro 


The handedness of chiral molecules can be probed spectroscopically, but acquiring data can take hours, 
which is a problem for time-resolved studies. The latest method records such data ina flash. 


Most daily encounters with chirality and chiral 
recognition go unnoticed, including such 
mundane events as putting on shoes, shaking 
hands with someone and admiring the helical 
patterns of seashells. But the concept that 
certain objects and interactions have an intrin- 
sic ‘handedness’ permeates the entire fabric 
of science, and has a crucial role in diverse 
physical, chemical and biological processes. Of 
special importance for the molecular sciences 
are the different chemical and biochemical 
reactivities displayed by enantiomeric (mirror- 
image) compounds. Indeed, many drugs are 
single enantiomers; their mirror-image ver- 
sions are often ineffective, or even harmful. 

The development of tools to discriminate 
between enantiomers and to work out how 
chiral molecules interact is therefore an 
ongoing challenge. In particular, the unique 
signatures that can be obtained using optical 
probes that are based on chiral — circularly 
polarized — light (Fig. 1a) represent a long- 
standing, yet constantly evolving avenue of 
research’. On page 310 of this issue, Rhee et al. 
demonstrate a significant advance that prom- 
ises to open new vistas in the realm of chirop- 
tical spectroscopy, including the tantalizing 
possibility of interrogating chirospecific phe- 
nomena with femtosecond temporal resolution 
(1 femtosecond is 10°’° seconds). 

When electromagnetic radiation passes 
through a randomly oriented (isotropic) 
ensemble of non-chiral molecules — such 
as those in a gas, liquid or glass — its state of 
polarization, defined by the direction of oscil- 
lation of the electric-field vector, remains 
unaltered. This implies that the sample’s fre- 
quency-dependent refractive index, n, and 
absorption index, «, are independent of opti- 
cal polarization. These parameters govern 
the speed and intensity of a wave propagating 
through the medium. 

In contrast, light traversing a chiral medium 
experiences an intrinsic helical anisotropy, 
commonly referred to as optical activity, 
which causes right-circularly (R) and left- 
circularly (L) polarized light to undergo dis- 
tinct retardation (n, # nz) and attenuation 
(Kk, # Kp) processes’. The differential retarda- 
tion (An =n,- ng) leads to the phenomenon 


Figure 1 | Optical polarization and optical activity. a, The oscillating electric-field vector of linearly 
polarized electromagnetic radiation (blue) can be broken down into equal contributions of right- 
circular (red) and left-circular (green) polarized waves. b, Chiral media are optically active — they 
impose an intrinsic anisotropy on linearly polarized light. Here, the electric-field vector oscillates 
in a plane defined by the blue arrow. The phenomenon known as circular birefringence (CB) causes 
one component of circular polarization to propagate faster than the other, so that the plane of the 
electric-field vector rotates by an angle ©. c, In circular dichroism (CD), one component of circular 
polarization is absorbed more strongly than the other, leading to net polarization ellipticity at an 
angle @. Rhee et al.” report a technique for acquiring information about the optical activity of chiral 
compounds on a femtosecond timescale, in principle opening the way to ultrafast time-resolved 
studies of chirospecific chemical and biochemical processes. 


of circular birefringence (CB, Fig. 1b), the 
observable effect of which is the rotation of 
linearly polarized light from its original angle; 
the differential attenuation (Ax = x, — Kp) causes 
circular dichroism (CD, Fig. 1c), in which R- 
and L-polarized light are absorbed unequally. 

The two enantiomeric forms ofa chiral com- 
pound display optical activities (An and Ax) of 
equal magnitude, yet opposite sign, reflecting 
the chirospecific nature of their interactions 
with oscillating electric and magnetic fields. 
The patterns obtained by recording CB or CD 
as a function of wavelength thus afford a spec- 
tral ‘fingerprint’ for a given enantiomer and its 
environment. The majority of such measure- 
ments rely on electronic excitations caused 
by radiation in the visible and ultraviolet 
regions of the spectrum. But analogous meth- 
ods involving infrared light — which excites 
molecular vibrations — have also emerged as 
powerful probes of molecular structure and 
function’”. Rhee et al. build on the techniques 
of vibrational circular dichroism (VCD)*”, 
in which the unique chiroptical behaviour 
displayed by each vibrational feature of an 
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infrared absorption spectrum conveys infor- 
mation about the configuration and conforma- 
tion of the molecule under investigation. 

Unfortunately, the chiroptical response 
of an isotropic solution is exceedingly small, 
and so any viable probe of CD must be able 
to distinguish a minuscule chiral absorption 
signal from a substantially larger background 
of achiral absorption. For VCD (where the 
ratio of signal to background absorption is 
typically 10°-10°°), exquisitely sensitive detec- 
tion schemes are needed to isolate the slight 
differences in sample absorption revealed 
by alternating bursts of R- and L-polarized 
infrared light. Even when the most efficient 
data-collection methods are used (such as 
Fourier-transform infrared spectroscopy’), 
hours of continuous averaging are often 
required to produce statistically meaningful 
VCD spectra, thus limiting the usefulness of 
the technique for time-resolved studies. 

But Rhee et al.” have developed a strategy 
that permits both vibrational CB (known as 
vibrational optical rotatory dispersion, VORD) 
and VCD to be recorded without the need to 
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differentially discriminate minute signals. In 
their technique, a short infrared pulse is used to 
coherently excite vibrations in a dissolved chi- 
ral compound, thereby creating a macroscopic 
assembly of oscillating dipoles. This induced 
‘polarization radiates the vibrational properties 
of the target molecules in the form of a signal 
known as a free-induction decay. This signal 
has chirospecific information encoded in the 
amplitude and phase of its electric field — in 
the same way that the characteristic sound of a 
ringing bell reflects its size and shape. 

So howis the weak chiroptical signal discrim- 
inated from the strong achiral background? 
The authors rely on the fact that linearly polar- 
ized light always develops another polarized 
component (perpendicular to the original 
direction of polarization) as it crosses a chiral 
medium (Fig. 1b, c). Using polarization analys- 
ers of exceptional quality aligned to detect only 
signals orthogonal to the impinging (linearly 
polarized) infrared beam, the authors were able 
to isolate the desired optical-activity response 
in their experiments. 

To amplify and interpret the responses, Rhee 
et al. exploited the phenomenon of interfer- 
ence, in which the superposition of oscillating 
waves gives rise to distinctive patterns of new 
waves. They thus combined the free-induction 
decay from the chiral sample with an infrared 
reference pulse of much greater intensity. The 
coherently beating portion of the resultant 
pattern — the product of the weak signal field 
and the strong reference field — can be iso- 
lated and analysed using Fourier-transform 
manipulations’, yielding information about 
the sample. 

Ultimately, the authors were able to ascertain 
a property of chiral molecules known as the 
complex susceptibility. This quantity consists of 
a part that is related to An and another part that 
is related to Ax. In this way, VORD and VCD 
spectra can be acquired for each incident pulse 
of infrared radiation, with the timescale of each 
measurement essentially being given by the 
pulse duration (less than 100 femtoseconds). 

Rhee et al. validated their approach by exam- 
ining the enantiomers of a simple chiral mol- 
ecule (limonene) dissolved in solution. They 
showed that the resulting spectra match those 
obtained by conventional VCD spectrometers 
and agree with theoretical VCD predictions. 
In principle, the techniques and concepts 
introduced in this work’ should be applicable 
to a variety of chiroptical studies on systems 
in equilibrium, including those based on 
electronic (rather than vibrational) phenom- 
ena, provided that polarization analysers of 
sufficient quality are available for the targeted 
spectral region. 

A time-resolved analysis of optical activity 
has yet to be performed, but the present results 
establish the basic principles required for stud- 
ies in which changes in chirality induced by an 
abrupt pulse of ‘pump light can be monitored 
at subsequent points in time using a synchro- 
nously delayed infrared ‘probe’. The resulting 
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arrays of VORD and VCD spectra should allow 
an unprecedented glimpse of the ultrafast 
structural and conformational changes that 
accompany chirospecific chemical and bio- 
chemical transformations, such as those that 
occur during asymmetric catalysis or protein 
folding. Once the practical details are sorted 
out, the successful execution of time-resolved 
measurements will revolutionize our ability to 
interrogate chiral molecules and their inter- 
actions, in much the same way that the advent 
of high-speed photography overcame the 
limitations of visual perception. a 
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CANCER 


The nuances of therapy 


Lee M. Ellis and David A. Reardon 


Oncologists use drugs that limit a tumour's blood supply to prevent its 
growth. Although the initial effects of these drugs are beneficial to patients, 
new data suggest that their long-term effects warrant further study. 


Angiogenesis — the formation of new blood 
vessels — is a hallmark of cancer, and allows 
tumour growth. Anti-angiogenic therapy 
offers great promise and is often used to treat 
cancer, either alone or in combination with 
chemotherapy. But, like all anticancer thera- 
pies, agents that inhibit tumour angiogenesis 
are prone to either intrinsic or acquired resist- 
ance. Paez-Ribes et al.' and Ebos et al.” show in 
two preclinical (in vitro and animal) studies'” 
published in Cancer Cell that, depending on 
treatment conditions, anti-angiogenic therapy 
could theoretically increase the likelihood of 
tumour invasiveness and spread. 

One protein with a central role in pro- 
moting angiogenesis is vascular endothelial 
growth factor (VEGF), and anti-VEGF agents 
are therefore commonly used to treat cancer. 


Figure 1| The MRI 

evidence. a, In agreement 
with the latest preclinical 
data'”, MRI scans from 

a patient with recurrent 
glioblastoma show that, 

after treatment with the 
VEGF-neutralizing antibody 
bevacizumab and the 
chemotherapeutic agent 
irinotecan, the macroscopic 
‘enhancing’ tumour 
disappears, consistent 

with a complete response. 

b, However, microscopic 
tumour infiltration to other 
brain regions (arrows) is 
detectable in the patient after 
this therapy, using a different 
type of MRI that highlights 
brain inflammation and 
swelling". 


Before therapy 


On therapy 
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Nonetheless, as highlighted by several studies, 
host and tumour responses to loss of VEGF- 
mediated signalling can be complex’. 

In mice, deletion of a single copy of the 
VEGF gene causes embryonic death‘, suggest- 
ing that it is essential for survival. Loss of this 
gene specifically in endothelial cells, which 
line blood vessels, increases the probability of 
angiogenesis-related disease conditions such 
as thrombosis, haemorrhage and fibrosis°. 
What's more, VEGF inhibition affects not only 
tumour vasculature but also healthy host tissues. 
When VEGEF activity is impaired in tumour- 
free mice, for example, compensatory pathways 
are activated that in theory could augment 
tumour invasion and metastasis in patients 
with cancer. Ebos et al. have previously shown® 
that anti- VEGF therapy increases levels of 
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several blood-plasma proteins, including 
G-CSF, PIGF and SDF1, that may support 
invasion and metastasis by tumour cells. 
Response to oxygen shortage may also lead to 
a compensatory programme mediated by the 
mobilization of cells derived from the bone 
matrow, increasing tumour invasiveness and 
angiogenesis’. 

The practical issue that Paez-Ribes et al.' and 
Ebos and colleagues” now raise through pre- 
clinical studies is whether anti- VEGF therapy 
could shorten patients’ long-term survival. 
Paez-Ribes et al. studied the effects of either 
anti-angiogenic drugs or genetic ablation of 
the gene encoding a VEGF protein in mouse 
models of two cancers: glioblastoma, the 
deadliest of brain cancers, and pancreatic 
neuroendocrine tumours. They find that, 
although these treatments have antitumour 
effects, loss of VEGF signalling also increases 
tumour invasiveness and metastasis. 

Ebos et al.” reached similar conclusions 
when they treated mice with VEGF-recep- 
tor inhibitors. Intriguingly, they found that 
whether these drugs prevented tumour growth 
or increased the likelihood of metastasis 
depended on tumour stage and the duration of 
treatment. Whereas sustained use of the drugs 
on established tumours significantly inhibited 
tumour growth, short-term treatment before 
the implantation of tumour cells into mice 
‘pre-conditioned’ the animals for accelerated 
metastasis. 

How do these findings relate to previous 
clinical observations? From experience in 
the clinic, single-agent anti-VEGF therapy 
is ineffective for treating non-small-cell lung 
cancer, and metastatic breast and colorectal 
cancers. However, anti- VEGF agents — spe- 
cifically, bevacizumab, a VEGF-neutralizing 
antibody — improve the effects of chemo- 
therapy in patients with these tumour types’. 
It is likely that such effects of combined ther- 
apy are due to complex interactions between 
anti- VEGF and chemotherapeutic agents, 
and so are not directly comparable with the 
preclinical data’, 

In addition, a study* that analysed data from 
a prospective database of more than 1,900 
patients with metastatic colorectal cancer indi- 
cated that the overall survival of those given 
bevacizumab throughout their treatment, in 
combination with various chemotherapeutic 
agents, was much better than the results of any 
clinical trial in which anti- VEGF therapy was 
stopped after initial disease progression — 
tumour growth despite therapy. This observa- 
tion, combined with the data alluded to above, 
suggests that the preclinical findings of Paez- 
Ribes et al. and Ebos et al. should not dissuade 
oncologists from using anti- VEGF therapy in 
combination with chemotherapy. However, 
these studies'” may be more relevant to patients 
with a form of liver cancer called hepatocellu- 
lar carcinoma, and possibly those with kidney 
cancer”"”. Single-agent anti-VEGF therapy in 
both of these patient groups improves survival 


with no clinically evident deleterious effects. 

Investigators are currently awaiting the 
results of several clinical trials in patients at risk 
of harbouring undetectable metastatic tumours 
and who receive chemotherapy together with 
bevacizumab in the adjuvant setting, in an 
attempt to prevent these ‘micrometastases’ 
from growing. In particular, the outcome of 
a trial in which patients with colorectal car- 
cinoma received six months of combination 
therapy followed by six months of bevacizumab 
alone is expected soon. Results gathered from 
such studies should be evaluated not only for 
cancer-progression-free and overall survival 
rates, but also for the pattern of tumour recur- 
rence — that is, whether patients with cancer 
recurrence demonstrate widespread metas- 
tases, as seen in the preclinical studies’. 

Observations in patients with recurrent 
glioblastoma who were treated with anti- VEGF 
therapy with or without the chemotherapeutic 
agent irinotecan seem to support some of the 
findings of Paez-Ribes et al. and Ebos and col- 
leagues. Unlike historical salvage therapies, 
which achieve low response rates of nominal 
duration, most patients with recurrent glio- 
blastoma respond to anti- VEGF therapy, as is 
evidenced by both improved MRI (magnetic 
resonance imaging) scans and improvement 
in their neurological deficits. What’s more, in 
many, these gratifying responses are meaning- 
fully durable'!. Nevertheless, most patients 
become resistant to anti- VEGF therapy, show- 
ing marked infiltration of tumour cells, and with 
some tumours spreading to other regions of the 
brain, such as the contralateral hemisphere, 
brainstem and leptomeninges (Fig. 1); progres- 
sion is also often quick and rapidly fatal”. 

This observation supports the findings of 
Paez-Ribes et al.!and Ebos et al.”, and together 
they indicate that, although anti- VEGF therapy 
may meaningfully improve the therapeutic 
outcome for some patients, resistance to this 
therapy can develop and may lead to more 
aggressive and widespread disease. These data 
therefore highlight the point that oncologists 
must carefully scrutinize clinical data, not 
only focusing on short-term endpoints, such 
as response rates and time to progression, but 
also patterns of failure, overall survival and 
quality of life. 

On the basis of these findings'*", should 
anti- VEGF therapy be limited in duration? As 
noted in the preclinical setting’, even a short 
duration of anti- VEGF therapy can increase 
the likelihood of invasion and metastasis by 
tumour cells; yet in the clinic, acceleration of 
tumour growth and/or metastasis is rarely (if 
ever) noted after initiation of therapy. Some 
investigators have noted a ‘tumour flare’ on 
cessation of anti-VEGF therapy, but these 
observations are anecdotal. One approach to 
resolving this dilemma is to identify increases 
in the levels of soluble markers or mediators 
of tumour growth and/or metastasis just 
before starting anti- VEGF therapy. Indeed, an 
increase in levels of PIGF and HGF — potential 
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50 YEARS AGO 

Television and the Child: An 
Empirical Study of the Effect 

of Television on the Young. By 
Hilde T. Himmelweit, A. N. 
Oppenheim and Pamela Vince 
— The widespread interest 
aroused by this report is easily 
understood. It is the first study of 
its kind to be conducted among 
English-speaking peoples and, 
indeed, probably in the world... In 
general, television does not cause 
listlessness, loss of sleep, bad 
dreams, lack of concentration, 
or eye strain. Children are not 
made more passive or become 
more aggressive, or kept away 
from youth clubs by television. 
Against this are a number of 
conclusions which, on balance, 
suggest that television, as at 
present administered, has 

more harmful than beneficial 
influence on children... The main 
recommendation made by Dr. 
Himmelweit and her colleagues 
is that the B.B.C. and I.T.A. must 
collaborate much more over the 
balance of programmes. 

From Nature 21 March 1959. 


100 YEARS AGO 

Ina letter to the Times of 

March 15, Prof. Osler directs 
attention to the useful work 
which is being done by the 
Italian Society for the Study 

of Malaria, founded ten years 
ago, for the prevention of 
malarial diseases. The society 
has promoted legislation for 
the gratuitous distribution of 
quinine, has prepared quinine 
in its most agreeable forms, and 
has introduced into practice the 
mechanical measures based on 
the defence of the habitation and LE] 
the individual from the bites of 
mosquitoes. The result is that the > 
mortality from malaria in Italy has 
declined from 16,000 in 1902 to 
about 4000 in 1908... the growth 
of our knowledge of the causation 
and prevention of malaria 
illustrates the stages through 
which so many of the great 
discoveries in medicine have had 
to pass, and is a striking example 
of the value of experimental 
methods in medical research. 
From Nature 18 March 1909. 
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mediators of resistance to anti-VEGF therapy 
— just before the resumption of growth of 
colorectal tumours and their metastases has 
been reported for this approach”’ in patients 
receiving chemotherapy plus bevacizumab. 
Finally, although it was initially hypothesized 
that anti-angiogenic therapy would induce 
tumour dormancy, this is clearly not the case 
in the clinic. Thus, following an initial favour- 
able response to chemotherapy combined with 
anti- VEGF therapy, single-agent anti- VEGF 
therapy should not be used until there are data 
from clinical trials to either support or refute 
this approach. a 
Lee M. Ellis is in the Departments of Cancer 
Biology and Surgical Oncology, The University 
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GEOPHYSICS 


Hot blanket in Earth's deep crust 


Jean Braun 


Studies of rocks from Earth's crust suggest that the lower crust is a good 
thermal insulator. The knock-on effects of this finding are many — one 
being the crust’'s increased potential to generate more magma. 


We know little about the temperature of the 
deepest parts of Earth’s continental crust, 
which can be 30-40 kilometres below the 
surface. We cannot measure temperature 
directly beyond a few kilometres down, at the 
bottom of deep mines or drill holes. Yet tem- 
perature governs many geological processes, 
including the generation of magmas through 
the melting of rocks in the lower crust and 
the mechanism by which rocks deform. In 
the crust, heat from the underlying mantle is 
mainly transported by conduction. The effi- 
ciency of the process is regulated by the crust’s 
thermal conductivity: a high conductivity 
results in efficient heat flow and a relatively low 
temperature at the base of the crust; conversely, 
a low conductivity results in poor conduction 
of heat and a high temperature at the base of 
the crust — the lower crust is a good insulator 
and thus acts as a blanket over the mantle. 
Like most materials, rocks become less effi- 
cient at transporting heat (their conductivity 
decreases) as temperature increases, especially 
in the vicinity of their melting temperature. 
But measuring thermal conductivity at high 
temperatures is difficult because, close to the 
melting temperature, heat is transported not 
only by conduction but also by radiation. This 
implies that most existing high-temperature 
rock-conductivity measurements have been 
overestimated. Consequently, in many applica- 
tions, rock conductivity is assumed to be con- 
stant throughout the crust. Whittington et al.' 
(page 319 of this issue) now provide more accu- 
rate data on a variety of crustal rock samples 
that demonstrate a strong dependence of rock 
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thermal conductivity on temperature. This 
means that the conductivity of rocks at tem- 
peratures existing in the lower crust is much 
lower — by as much as 50% — than previously 
recognized. The implication is that the lower 
crust may be hotter than we thought (Fig. 1). 

By using a recently developed technique 
known as laser-flash analysis, which consists of 
subjecting one end of a small (10 mm x 1 mm) 
rock sample held at some temperature in a hot 
furnace to a laser pulse of heat, Whittington 
and colleagues were able to measure the time 
taken for the heat to reach the other end. They 
then used a carefully calibrated model of heat 
transport in the sample to extract accurate 
estimates of its thermal conductivity. 

The authors’ observation’ that rock con- 
ductivity varies strongly with temperature, 
and the related realization that the lower crust 
has a lower conductivity and is thus warmer 
than we thought, has far-reaching implications 
for our understanding of fundamental mag- 
matic and tectonic processes (Fig. 2). The con- 
ductivity of a rock is more correctly described 
by its thermal diffusivity — the ratio of con- 
ductivity to the product of the heat capacity 
(the ability to store heat) and density. It is 
this that controls the rate at which the crust 
responds to a perturbation in temperature such 
as is caused by the injection of magma from 
the underlying mantle or by ‘strain heating’ in 
shear zones (Fig. 1). 

Reduced conductivity in the lower crust 
would thus prevent the dissipation of the heat 
brought into the crust by intrusion of basaltic 
rock from the mantle; temperatures within and 
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Figure 1| Thermal conductivity and temperature 
in Earth's crust. The temperature profile of 
Earth’s crust is shown for two scenarios of 
conductivity for a crust subjected to a constant 
basal heat flux: uniform rock conductivity 
(thin blue line) and variable rock conductivity 
(thick blue line). Whittington et al.’ show that 

a non-uniform conductivity across the crust 
results in an increase in basal temperature, as 

is indicated by the arrow. The shaded areas 
show the difference in the crust’s response to 

an instantaneous perturbation in the basal heat 
flux, such as that caused by a basaltic intrusion 
or shear heating: the light-blue area denotes the 
resulting change in temperature for a uniform 
conductivity; dark blue is for variable 
conductivity. In the latter case, the reduced 
conductivity of the lower crust decreases 

the upward dissipation of the thermal event 
and enhances heating of the lower crust. The 
red lines show the two conductivity profiles. 
The depth scale spans the entire crust (from the 
surface down to 35-40 km); the temperature 
scale varies between 0 °C and an upper range 

of 500-800 °C (this upper limit depends on the 
assumed contribution from heat production due 
to radioactivity). 


around the intrusion should be higher than we 
currently predict, and we should expect more 
rapid and efficient melting of the lower crust 
for the same amount of heat injected from the 
mantle. This effect has important consequences 
for our understanding of arc or andean-type 
volcanism (that is, volcanism associated 
with subduction zones). Lower conductivity 
of magma also helps its ascent through the 
crust, because it slows down the cooling 
and ‘freezing that occur by contact with the 
surrounding, colder rocks. 

The deformation of the crust in response to 
tectonic forces is commonly accompanied by 
strain localization: that is, by the formation of 
narrow, ductile shear zones. During deforma- 
tion at high stress levels, mechanical energy 
is transformed into heat (a process called 
strain heating), which is dissipated from the 
shear zone by conduction. A reduced ther- 
mal conductivity at high temperature would 
limit heat dissipation, resulting in much hot- 
ter shear zones than commonly assumed. As 
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Whittington et al.' demonstrate, this may even 
lead to melting inside the shear zone, causing 
further localization of the deformation within 
the shear zone. But this result should be inter- 
preted with caution, as strain heating depends 
strongly on the evolution of stress within the 
zone, which itself depends on the assumed 
deformation mechanism and is currently the 
subject of much debate’. Enhanced strain heat- 
ing could also explain the formation of larger 
than expected volumes of granitic magmas 
produced by partial melting of deep crustal 
rocks*. This process, referred to as anatexis, 
is observed in many mountain ranges, most 
notably in the granitic belts associated with the 
Main Central Thrust fault in the Himalayas. 

A recent and vigorously debated model for 
the evolution of large, hot mountain belts, such 
as the Himalayan-Tibetan system’, relies on 
the formation of a ductile, partially molten 
lower crust that may be extruded at Earth’s sur- 
face in regions of high precipitation and thus 
surface erosion. The high-temperature condi- 
tions necessary for partial melting of the lower 
crust are thought to be achieved by increased 
heat production due to radioactivity, resulting 
from a thickened crust and/or shear heating. 
A reduced conductivity in the lower crust 
would significantly reduce the time necessary 
for such radiogenic heating to take place, and 
would potentially lead to higher crustal tem- 
peratures. This hypothesis, and many others 
concerning the dynamics of mountain belts, 
should now be revisited in view of Whittington 
and colleagues’ new measurements’ of thermal 
conductivity. 

Also in need of reassessment are most quan- 
titative models of Earth’s dynamical behaviour, 


5 Channel of 
molten rock ~~ 


Figure 2 | Implications of a lower crust that is warmer than expected. Whittington and colleagues’ 
observation’ that the lower crust is a good thermal insulator and is thus warmer than previously 
recognized has many consequences for magmatic and tectonic processes in convergent plate settings 
such as subduction zones: 1, more efficient melting and mixing of the lower crust following mantle- 
derived basaltic intrusion; 2, more rapid and efficient ascent of the resulting magma through the crust; 
3, enhanced strain heating during tectonically driven deformation of the lower crust; 4, enhanced 
strain localization; 5, quicker development of a lower-crust channel of molten rock that may be 
extruded at Earth’s surface in regions of high precipitation and thus surface erosion. 


because heat transport is such an impor- 
tant process inside the Earth. For example, a 
reduced crustal conductivity would also imply 
a higher mean temperature in the underlying 
mantle, especially during the early, ‘hotter’ 
stages of the planet’s evolution. This, in turn, 
has direct implications for our understanding 
of the early Earth's differentiation and the dis- 
tribution of elements in its various reservoirs 
(the crust, mantle and core). a 
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DINOSAURS 


Fuzzy origins for feathers 


Lawrence M. Witmer 


Cretaceous fossil deposits in China are famous for their feathered 
dinosaurs. But the surprising discovery of a herbivorous dinosaur with a 
filamentous coat raises fresh questions about the evolution of feathers. 


Liaoning Province in northeastern China is 
renowned for the fossils that document, in 
often vivid detail, virtually the entire biota 
that lived over a period of several million 
years during the Early Cretaceous (about 
125 million years ago)’. Although exquisite 
fossils of diverse vertebrates, invertebrates 
and plants have been recovered, it’s the 
spectacular feathered dinosaurs that have 
received most attention” * and caused much 
controversy”®. On page 333 of this issue, Zheng 
and colleagues’ present the discovery of a small 
dinosaur, Tianyulong confuciusi, from the 
Yixian Formation of Liaoning, that promises 
to send the debate on dinosaur feathers in 


a totally new direction — and a confusing 
direction, at that. 

Even without preservation of portions of 
its skin, Tianyulong would be a notable find, 
because its genealogical ties are to a group of 
herbivorous dinosaurs, called heterodonto- 
saurids, that had undergone their evolution- 
ary radiation 70 million years earlier, making 
Tianyulong a ‘living fossil’ in its own time. 
Heterodontosaurids used to be regarded as a 
fairly obscure group, related to the more famous 
duck-billed hadrosaurs. But recently, hetero- 
dontosaurids have taken centre stage as the 
most evolutionarily basal branch of the entire 
great radiation of herbivorous dinosaurs, the 
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Ornithischia (Fig. 1, overleaf), that included 
not only hadrosaurs but also Triceratops, Stego- 
saurus anda host of related animals®. Were it not 
for its skin, Tianyulong would be important as a 
late-surviving twig of this branch of the ornithis- 
chian family tree, and the first of its kind known 
from Asia. But the fossils of Tianyulong, splayed 
on a stone slab, reveal three patches of long 
filaments reminiscent of structures thought to 
be the evolutionary progenitors of feathers. The 
only problem is that Tianyulong isn't supposed 
to have anything like feathers. 

Before the 1990s, life was simple: feathers 
were thought to be an exclusively avian 
attribute, found in all birds today and extend- 
ing back to the iconic Archaeopteryx in the Late 
Jurassic, some 150 million years ago. The dis- 
covery of very bird-like feathers, complete with 
shaft and vanes (pennaceous feathers), in some 
of the predatory theropod dinosaurs found in 
Liaoning (such as Caudipteryx and Micro- 
raptor) rocked the scientific world™*”, because 
the feathered dinosaurs were outside the evo- 
lutionary group of acknowledged birds. Still, 
it wasn’t completely unexpected, in that these 
dinosaurs are representatives of the theropod 
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Maniraptora 


Theropoda 


Dinosauria 


Figure 1| Dinosaur relationships and skin characteristics. The dichotomy between feathered birds 
and scaly reptiles was demolished by the discovery of true pennaceous feathers in the non-avian 
maniraptoran dinosaurs thought to be closest to birds. More controversial have been the filamentous 
skin structures, variously regarded as external ‘protofeathers’ or internal structural fibres. Zheng and 
colleagues’ discovery’ of filamentous structures in Tianyulong further complicates the picture, in 
that this is an ornithischian dinosaur far removed from the ancestry of birds. Many other dinosaurs, 
such as other ornithischians and basal saurischians, had reptilian, scaly skin. So, were the ancestral 
dinosaurs fuzzy animals cloaked in ‘protofeathers, which were subsequently lost multiple times in 
later groups? Or were dinosaurs primitively scaled, and did later groups independently evolve wispy, 


feather-like or even bristly skin coverings? 


group, Maniraptora, that is skeletally the most 
bird-like, and from which birds are widely 
thought to have evolved. Feathers became just 
one more character showing that link, albeit 
a compelling one. But Tianyulong is not at all 
closely related to birds and, as a heterodonto- 
saurid ornithischian, is on an entirely separate 
branch of the dinosaur family tree (Fig. 1). 
And indeed, Tianyulong doesn’t have true 
pennaceous feathers. It has long filaments, very 
similar to what have been called ‘protofeathers’ 


CHEMISTRY 


Thinking outside the flask 


or, more non-committally, ‘dinofuzz. These 
filaments are evident in some theropods such 
as Caudipteryx that have true pennaceous 
feathers, but are also found in a range of other 
theropods that lack definitive feathers, such 
as the basal coelurosaur Sinosauropteryx, the 
therizinosauroid Beipiaosaurus and the basal 
tyrannosauroid Dilong’*”’. 

Herein lies the controversy. No one disputes 
that these filaments are integumentary (in the 
skin). The question is, from what part of the 


skin do they come — the outside or the inside? 
Many**""' have regarded them as epidermal 
(that is, as projecting, external appendages that 
are somehow evolutionarily related to feath- 
ers). Others”® have regarded them as dermal 
(that is, as the remains of collagen fibres below 
the skin’s surface). In this context, the differ- 
ence between epidermal and dermal is huge. If 
they are epidermal, then they bear not only on 
feather evolution and avian origins, but also on 
metabolic physiology, behavioural display and 
flight. If they are dermal, then they're ultimately 
structural in function and have little bearing on 
those other issues. Unfortunately, the rhetoric 
of this debate has overshadowed the scientific 
evaluation of evidence’. If the integumentary 
filaments of Tianyulong are dermal (colla- 
gen fibres), then they become interesting but 
not of monumental importance. However, 
if they are epidermal, then they take on great 
significance. 

Given the position of Tianyulong near the 
evolutionary base of ornithischian dinosaurs, 
the presence of epidermal, filamentous, feather- 
like structures could mean that the ancestral 
dinosaur was a fuzzy (though maybe not 
cuddly) animal. Of course, that would also 
mean that a fuzzy coat of protofeathers was lost 
many times in dinosaur evolution, because lots 
of dinosaur groups on both great branches of 
the dinosaur family tree are known to have scaly, 
reptilian skin (Fig. 1). But, before complicated 
scenarios for feather evolution are concocted, 
the fundamental question to be answered is 
whether the filaments of Tianyulong are on the 
outside or inside of the skin’s surface. 

That seemingly simple question is sur- 
prisingly hard to answer. The obvious test 
would be a biochemical or molecular assay to 
find out if the filaments are composed of the 
feather protein keratin or the collagen protein, 
but the mode of fossil preservation may not 


because polymers can't easily 
diffuse through the nanofibre 
matrix). Furthermore, the products 
can be analysed directly within 


The present credit crunch is 
forcing everyone to save money, 
and chemists are no exception. A 
good cost-cutting measure is to 
perform reactions on a small scale, 
thereby reducing the outlay on raw 
materials and minimizing the energy 
required to drive the reactions. 
Reporting in Nature Chemistry, 
Anzenbacher and Palacios describe 
the ultimate in miniature reaction 
vessels — junctions formed when 
two polymer nanofibres are fused 
together (P. Anzenbacher Jr & M.A. 
Palacios Nature Chem. doi:10.1038/ 
nchem.125; 2009). 

The authors prepared nanofibres 
— each hundreds of times narrower 
than a human hair — from readily 
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available polymers, and loaded them 
with various chemical reactants. 
They then laid fibres containing 
different reactants across each 
other and exposed them to either 
heat or solvent vapour. This caused 
the fibres to fuse together, forming 
junctions that defined discrete 
chemical reactors with attolitre-scale 
volumes (1 attolitre is 10“ litres). 
The junctions contained as few as 
1,500 molecules of each reactant. 
To illustrate the principle of their 
ultra-small reactors, Anzenbacher 
and Palacios doped fibres with two 
non-fluorescent compounds that 
form a fluorescent product when 
they react. When the two types 
of fibres were overlapped to form 


arandom mat and then heated, 
fluorescence at the fused junctions 
clearly indicated the formation of 
the reaction product (pictured). 
The authors showed that 
several types of reactions can be 
performed in their attoreactors, 
including those in which one of 
the reactants is polymeric 
(although reactions with 
two polymeric reactants are 
expected to be problematic, 
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the junctions using fluorescence 
measurements or mass spectrometry. 
Several applications for these 
attoreactors suggest themselves. 
For example, libraries of nanofibres 
could be prepared in which each 
nanofibre is loaded with a different 
compound from the same chemical 
class. Selected libraries could then 
be reacted with each other, as in 
combinatorial chemistry, to prepare 
many different products quickly 
and easily. If the products could 
be screened directly for biological 
activity, this would be useful for 
the high-throughput preparation 
and testing of compounds in 
drug-discovery programmes. 
Stephen Davey 
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permit a meaningful test. That leaves observable 
structure, and here the best indicator would 
be whether the filaments are tubular or solid 
in cross-section. Feathers are hollow, tubular 
structures, and developmental models predict 
that ‘protofeather’ filaments should also be 
hollow". By contrast, collagenous filaments 
should be essentially solid. 

Zheng and colleagues’ interpret the filaments 
of Tianyulong as being hollow, although it’s fair 
to question their evidence (longitudinal stripes 
on the filaments). But other attributes of the 
filaments are also highly suggestive of their 
being epidermal. For example, the filaments 
associated with the base of the tail are extremely 
long, and, given that the tail is already 
reinforced internally with stiffening rods of ossi- 
fied tendons, it is possible that these filaments 
indeed project outside the skin’s surface. Even 
if they can be shown to be definitively epider- 
mal, the ultimate question is whether they are 


part of the evolutionary lineage of true feathers 
or an independent evolution of projecting 
epidermal appendages. Certainly, the find- 
ing of very differently structured, projecting, 
hollow ‘bristles’ on the tail of another Yixian 
ornithischian, Psittacosaurus (a basal horned 
dinosaur)”, raises the possibility that there may 
bea range of filamentous epidermal structures 
in dinosaurs, and that not all such structures 
may be related evolutionarily to feathers”. 
Perhaps the only clear conclusion that can 
be drawn from the foregoing is that little Tian- 
yulong has made an already confusing picture 
of feather origins even fuzzier. Such an out- 
come is common in palaeontology. But the 
prospects of new fossils, new molecular and 
imaging techniques (such as synchrotron tom- 
ography), and even new ideas, offer the hope of 
bringing the evolutionary picture into sharper 
focus — and that picture may well end up being 
of fuzzy dinosaurs. a 
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GLOBAL CHANGE 


West-side story of Antarctic ice 


Philippe Huybrechts 


During the past five million years, the West Antarctic ice sheet has waxed 
and waned in size. A two-pronged reconstruction of that history provides 
clues to the ice sheet's future behaviour. 


For human societies, the prospect of sea-level 
rise is probably the most serious long-term 
threat from unabated climate warming. Both 
the Greenland ice sheet and the West Antarc- 
tic ice sheet are believed to be vulnerable to 
warming at the levels projected for the com- 
ing centuries. Sustaining those levels for many 
more centuries or millennia could ultimately 
cause the demise of both ice sheets — pro- 
ducing a worldwide rise in sea level of about 
12 metres compared with today’s levels, of 
which some 5 metres would derive from West 
Antarctica’. 

The past can be a guide to the future. But the 
late Quaternary (the past 0.5 million to 1 mil- 
lion years), for which we have the most data, 
was generally colder than today, and so is not an 
ideal analogue for the future. The twin papers 
by Naish et al.” and Pollard and DeConto’, pub- 
lished in this issue, now present a step towards 
filling this gap in data and understanding. In 
their reconstruction of Antarctica’s glacial his- 
tory, the authors concentrate on the early to 
middle Pliocene, an interval of time between 
5 million and 3 million years ago, when plan- 
etary temperatures were more in the range of 
those projected for the coming centuries. 

The suspected vulnerability of the West Ant- 
arctic ice sheet stems from its particular setting. 
Itis grounded mostly below sea level and is sur- 
rounded by large floating ice shelves (Fig. 1). 
These floating extensions are in direct contact 


Ice sheet 


Grounding 
line 


Ice shelf 


Figure 1| The West Antarctic ice sheet. This 
much simplified depiction shows, to the left, the 
grounded marine ice, which sits upon bedrock or 
sediment on the sea floor. To the right, separated 
by the ‘grounding line’ is the floating ice shelf, 
which is thought to buttress the grounded 

ice sheet. The new observational study’ and 
modelling work’ identify a 40,000-year cycle 

in which the grounding line moves back and 
forth across the sea floor between glacial 

and interglacial states, punctuated by periodic 
ice-sheet collapses with more or less open-ocean 
conditions during super-interglacials. 


with the ocean and are widely believed to have 
acrucial role in keeping the grounded ice sheet 
in place. At present, Antarctic temperatures 
are too low to generate significant surface 
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melting during the summer, and this will 
remain so even with moderate warming. Some 
combination of sea-level changes and changes 
in sub-ice-shelf melting must therefore be a key 
process by which variations in global climate 
control changes in the West Antarctic ice sheet. 
A long-standing debate among glaciologists 
has been the mechanism by which changes to 
the ice shelf are transmitted inland to grounded 
ice sheets across their common boundary, the 
‘grounding line” (Fig. 1), and how effective this 
mechanism might be. 

Naish and colleagues’ observational evidence 
(page 322)? comes from a new sediment core 
recovered from beneath the Ross Ice Shelf near 
the current ice-shelf margin. The site turned 
out to be a strategic location. The range of 
sediments and rock types and other features 

seen in the core allowed the authors to dis- 

tinguish between a cyclical succession of 

conditions at the drill location: open ocean 

with little or no summer sea-ice (super- 

interglacial conditions warmer than those 

of today); coverage by a floating ice shelf 
(interglacial conditions much like today’s); 
and overriding by an ice sheet grounded on 
the sea floor (glacial conditions). Moreover, 
unlike many other glacial records in which 
each glaciation has erased the evidence of the 
previous one, this record stands out in having 
large sections without such hiatuses because 
tectonic forces have produced a favourable rate 
of bedrock sinking. 

Dating of undisturbed sections along the 
core enabled Naish et al. to identify 40 sedi- 
mentary cycles, each of about 40,000 years’ 
duration, during the Pliocene (up to 1.8 million 
years ago). These results are in good accord- 
ance with the same cyclicity seen in marine- 
isotope records of global ice volume and mean 
deep-sea temperatures’, and are in phase with 
summer half-year insolation caused by varia- 
tions of the Earth’s tilt at the same periodicity. 
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The geological sediment data, however, can 
only provide information on variations of the 
extent of the grounded ice sheet at one particu- 
lar location. They provide no indication of ice 
thickness or, by extension, ice volume, which 
is where the modelling comes in. 

Pollard and DeConto (page 329) used a 
three-dimensional ice-sheet model to simulate 
the evolution of the West Antarctic ice sheet in 
a manner consistent with the geological data. 
A novel feature of their ice-flow model is the 
way the different flow regimes of grounded and 
floating ice are coupled across the grounding 
line, following the results of recent theoreti- 
cal work°. Although approximations remain, 
the incorporation of this work in a three- 
dimensional model is new and is a notable 
methodological contribution to one of glaciol- 
ogy’s grand unsolved problems —the mechan- 
ics of how the West Antarctic grounding line 
migrates over time’. 

The model confirms that the conditions 
reflected in the sediment core are probably 
indicative of the evolution of the West Antarctic 
ice sheet as a whole during the past 5 million 
years. In addition, the model roughly repro- 
duces much of the inferred ice-sheet variations, 
ranging from full glacial extents, to interme- 
diate interglacial states similar to those per- 
taining today, to brief collapses of the entire 
West Antarctic ice sheet during the warmest 
super-interglacials. Unlike Antarctic ice-sheet 
reconstructions produced with broadly simi- 
lar models, including my own’*, Pollard and 
DeConto find a much more prominent role for 
sub-ice-shelf melting than for global sea-level 
variation in driving past migrations of West 
Antarctic grounding lines. 

However, the modelling does not solve every 
problem. The episodic collapses of the West 
Antarctic ice sheet during the past 5 million 
years also happen at times when there is little 
or no evidence for open-ocean conditions or 
high sea levels, adding an erratic component 
to the simulations. Other aspects — such as the 
timing of grounding-line retreat in the Ross 
Sea sector since the Last Glacial Maximum, 
and probably unduly thick ice in this sector 
both then’ and today — seem to contradict the 
observational evidence. This calls for further 
refinements of processes, such as the sliding of 
the ice over its bed, that will increase the cred- 
ibility of the model for supporting predictions 
on shorter timescales. Other issues concern the 
rather simple schemes used to prescribe crucial 
model forcing, such as sub-ice-shelf melting. 
These would best be addressed by more detailed 
global-climate and ocean models than are cur- 
rently possible on these long timescales. 

Even so, the modelling by Pollard and 
DeConto* may already be robust enough to 
attempt to put initial constraints on the amount 
of nearby ocean warming required to generate 
enough sub-ice-shelf melting to initiate a sig- 
nificant retreat of the West Antarctic ice sheet. 
The required ocean warmings, of the order of 
5 °C, may well take several centuries to develop. 
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But such an outcome could result from the 
accumulation of total greenhouse-gas emis- 
sions projected for the twenty-first century, if 
emissions are not greatly reduced’. The implied 
transition time for a total collapse of the West 
Antarctic ice sheet of one thousand to several 
thousand years* seems rapid by Antarctic 
standards. But it is nowhere near the century 
timescales for West Antarctic ice-sheet decay 
based on simple marine ice-sheet models””. m 
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Secret of synapse specificity 


Scott M. Thompson and Hayley A. Mattison 


How does the brain organize all of the information stored in memory? On 
the basis of a state-of-the-art imaging study of neuronal activity in real 
time, the answer seems to be, through specificity in space and time. 


Memories are encoded by a specific pattern 
of activity that is unique to the information 
being processed and stored. Memory forma- 
tion is almost certainly achieved at the syn- 
aptic junctions between neurons through 
the process of long-term potentiation (LTP), 
whereby synaptic communication between 
two simultaneously active neurons becomes 
stronger’”. One of the many attractions of LTP 
for explaining the biological basis of memory 
formation is that it displays remarkable syn- 
apse specificity*’: only synapses that are acti- 
vated become strengthened, and neighbouring 
synapses located only a micrometre or two 
away on the same neuron remain unaffected’. 
In an exciting paper on page 299 of this issue, 
Lee et al.° study living neurons using advanced 
fluorescence microscopy to conclusively dem- 
onstrate the molecular basis of the synapse 
specificity of LTP. 

A crucial enzyme for triggering LTP is 
CaMKIL Release of glutamate, a neurotrans- 
mitter, from the presynaptic neuron leads to 
depolarization of the postsynaptic neuron, 
thus relieving the voltage-dependent block 
of the NMDA-type glutamate receptors on 
the postsynaptic neuron, and allowing cal- 
cium ions (Ca”*) to enter it. The increased 
intracellular Ca™* concentration leads to acti- 
vation of CaMKII, which then phosphorylates 
itself, thereby remaining active even after the 
Ca”*concentration has fallen back to normal. 
Induction of LTP is therefore prevented either 
by NMDA-receptor antagonists or by CaMKII 
inhibitors’. 

To image real-time activation of CaMKII in 
living neurons, Lee et al.° used the technique 
of fluorescence resonance energy transfer 
(FRET)*. In FRET, distance-dependent energy 
transfer from a donor fluorescent molecule 
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(fluorophore) to an acceptor fluorophore 
nanometres away is measured through changes 
in fluorescence intensity. Lee et al. modified a 
‘reporter’ molecule — the CaMKII-based pro- 
tein Camuia’ — that has a donor fluorophore 
attached at one end and an acceptor fluoro- 
phore at the other. On activation of CaMKII, 
the donor-acceptor pair moves apart, lead- 
ing to decreased FRET. The authors further 
improved Camuia by using a bright fluoro- 
phore, green fluorescent protein (GFP), as the 
donor and a non-fluorescent acceptor fluoro- 
phore’®, a mutated GFP called REACh. Thus, 
only changes in the emission of GFP need 
be monitored, allowing emitted photons to be 
collected in a wider spectral window. 

Instead of measuring fluorescence intensity, 
Lee et al. measured the fluorescence lifetime 
of the donor fluorophore’. When CaMKII is 
inactive, REACh at one end of Camuia is close 
enough to quench GFP emission and shorten 
its fluorescence lifetime. With REACh moving 
away upon CaMKII activation, however, the 
fluorescence lifetime of GFP is prolonged 
(Fig. la). That may sound a simple concept, 
but the lifetime of GFP in the excited state 
is less than 2 nanoseconds. Extremely brief 
(<100-femtosecond) bursts of high-energy 
excitation light must therefore be delivered and 
high-efficiency detectors of photon emission 
must be used to measure the time between an 
excitation pulse and the arrival of an emitted 
photon. 

Finally, to achieve adequate spatial resolution, 
as well as to detect as many emitted photons 
as possible, the authors used the technique of 
two-photon microscopy. This approach was 
particularly advantageous because of its supe- 
rior spatial resolution, as the experiments had 
to be performed in relatively thick slices of 
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Figure 1| Synapse-specific long-term potentiation (LTP). Lee et al.° used 
changes in the fluorescence lifetime of GFP bound to Camuia-CaMKII to 
study synapse-specific CaMKII activity. a, When CaMKII is inactive, the 
fluorescence lifetime of GFP is short. b, On photostimulation of NMDA 
receptors and subsequent Ca” influx, there is an increase both in GFP 
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fluorescence — corresponding to CaMKII activation — and in the volume 
of the stimulated spine through LTP. c, Although it takes 10 minutes for 
CaMKII to diffuse out of the spine, the enzyme becomes inactive within 

2 minutes, so that it acts only within the stimulated dendritic spine, thus 
accounting for the synapse specificity of LTP. 


brain tissue for LTP to be induced. The authors’ 
idea was to induce LTP at a single dendritic 
spine — small protrusions on the dendritic 
processes of neurons that are sites of synaptic 
communication and that enlarge in response to 
LTP. To achieve such specificity, Lee et al. used 
photostimulation, whereby ‘caged’ glutamate 
molecules are locally released after exposure 
to high-energy light. 

Having designed their tools so elegantly, Lee 
et al.° were ready to view CaMKII activation 
in single spines. Using patterns of photostimu- 
lation that induce LTP, they observed a sharp 
spike in the fluorescence lifetime of Camuia, 
corresponding to CaMKII activation. Impor- 
tantly, the increase in fluorescence lifetime 
was confined only to the photostimulated 
spine, indicating that CaMKII activation is 
synapse specific (Fig. 1b). The authors could 
prevent the increase in Camuia’s fluorescence 
lifetime using the NMDA-receptor antagonist 
APS, thus showing that CaMKII activation was 
triggered by Ca™ entry via NMDA receptors. 
Remarkably, although the structural and physi- 
ological changes underlying LTP persisted for 
hours, the increased lifetime of Camuia fluores- 
cence — and so the period for which CaMKII 
remains active — lasted less than 2 minutes. 
This period of time is several orders of magni- 
tude shorter than suggested from previous bio- 
chemical measurements, which had indicated 
that CaMKII remains active for hours after 
NMDaA-receptor-mediated Ca” influx’. 

Both Ca** and CaMKII can diffuse freely 
in the cytoplasm. So what accounts for the 
synapse specificity of this enzyme’s activa- 
tion? Why doesn’t locally activated CaMKII 
diffuse to neighbouring synapses and trigger 
LTP there? With a single photostimulus, Lee 
et al.° observed that Ca” levels decay back to 
the resting concentration within a few hundred 


milliseconds. Given the location of excitatory 
synapses on dendritic spines, this is too short a 
time for Ca”* to diffuse to neighbouring spines 
and activate CaMKII there (Fig. 1c). 

The authors found that the spread of acti- 
vated CaMKII is limited not only by its rate of 
inactivation but also by its low mobility. Using a 
photoactivatable GFP variant as a fluorophore, 
they measured the diffusion of CaMKII out of 
the spine through fluorescence decay. After 
photoactivation, fluorescence declined as the 
enzyme diffused out of the synapse. The decay 
took 10 minutes, however, probably because 
CaMKII is anchored to other proteins in the 
spine’. The authors thus conclude that a fast 
rate of CaMKII inactivation, together with its 
much slower rate of diffusion, restricts CaMKII 
activity to the stimulated synapse (Fig. Ic). 

Postsynaptic depolarization in the absence 
of glutamatergic stimulation can also acti- 
vate CaMKII in the spine by opening L-type 
Ca* channels®. These channels, however, do 
not contribute to the depolarization-induced 
increase in bulk Ca” levels. The authors sug- 
gest provocatively that there may be distinct 
pools of CaMKII in the spine that are activated 
by Ca” influx through different routes. Only 
the CaMKII pool activated following NMUDA- 
receptor-mediated Ca™ influx can trigger LTP, 
perhaps because only those CaMKII mol- 
ecules are anchored adjacent to appropriate 
substrates. 

It is important to bear in mind that there 
may be conditions in which LTP does involve 
communication with adjacent synapses. The 
protocol Lee and colleagues used is somewhat 
different from the patterns of high-frequency 
stimulation (100 Hz) that are commonly used 
to induce LTP. Surprisingly, we know little 
about the spatial pattern of intracellular Ca** 
diffusion after such stimulation, let alone about 
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CaMKIl activation. Intracellular Ca”*-buffering 
molecules at the activated synapse are likely to 
become saturated by high-frequency stimula- 
tion patterns, leading to more Ca™ escaping 
from the dendritic spine and, perhaps, to acti- 
vation of CaMKII in neighbouring synapses. 
Indeed, intense activation of NMDA recep- 
tors leads to the spread of CaMKII activation 
along the stimulated dendrites of neurons 
maintained in culture''. Furthermore, induc- 
tion of potentiation at one synapse is asso- 
ciated with an increased probability that 
synapses up to ten micrometres away from the 
stimulated synapse will themselves become 
potentiated, perhaps because of spread of the 
small GTPase enzyme Ras (ref. 5). It will be of 
interest to determine the synapse specificity of 
the key CaMKII substrates and Ras effectors 
that are ultimately responsible for long-term 
memory storage. a 
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BRIEF COMMUNICATIONS ARISING 


Hippopotamus and whale phylogeny 


Arising from: J. G. M. Thewissen, L. N. Cooper, M. T. Clementz, S. Bajpai & B. N. Tiwari Nature 450, 1190-1194 (2007) 


Thewissen etal.’ describe new fossils from India that apparently support 
a phylogeny that places Cetacea (that is, whales, dolphins, porpoises) as 
the sister group to the extinct family Raoellidae, and Hippopotamidae 
as more closely related to pigs and peccaries (that is, Suina) than to 
cetaceans. However, our reanalysis of a modified version of the data set 
they used’ differs in retaining molecular characters and demonstrates 
that Hippopotamidae is the closest extant family to Cetacea and that 
raoellids are the closest extinct group, consistent with previous phylo- 
genetic studies*’. This topology supports the view that the aquatic 
adaptations in hippopotamids and cetaceans are inherited from their 
common ancestor’. 

To conduct our analyses, we started with the same published matrix 
that Thewissen et al. modified’, but included all taxa (but see Appendix 
Fig. 2 where Andrewsarchus is removed from our results), retained the 
molecular partition that was derived from an earlier study’, added the 
three anthracotheres included by Thewissen et al.’, replaced 
‘Raoellidae’ with their character codings for Indohyus and Khirtharia, 
included most of the changes and additions to pakicetids that 
Thewissen et al.' suggested (see Appendix), and then conducted 
cladistic analyses of the entire matrix, with all characters weighted 
equally’ (see Appendix), as well as with homoplastic characters 
down-weighted’. We consider it particularly important to include 
molecular data because they are widely recognized as being instru- 
mental in reconstructing mammalian phylogeny’*. 

In the equal weights and differential weights analyses, Hippopo- 
tamidae is the closest extant taxon to Cetacea, and in those that include 
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Figure 1| Effect of molecular data on the phylogeny of cetaceans and 
terrestrial relatives. a, Simplified version of the single most parsimonious 
tree from the present study when homoplastic characters are down-weighted. 
b, Simplified version of the strict consensus shown in Fig. 2 of Thewissen 
et al.'. The carnivorous to omnivorous mesonychians are in red, terrestrial 


fossils, Raoellidae or the raoellid Indohyus is more closely related to 
Cetacea than is Hippopotamidae (Fig. 1). Hippopotamidae is the 
exclusive sister group to Cetacea plus Raoellidae in the analysis that 
down-weights homoplastic characters, although in the equally weighted 
analysis, another topology was equally parsimonious. In that topology, 
Hippopotamidae moved one node out, being the sister group to an 
Andrewsarchus, Raoellidae and Cetacea clade. In neither analysis is 
Hippopotamidae closer to the pigs and peccaries than to Cetacea, the 
result obtained by Thewissen et al.’. In all our analyses, pachyostosis 
(thickening) of limb bones and bottom walking, which occur in hippo- 
potamids”"®, are interpreted to have evolved before the pachyostosis of 
the auditory bulla, as seen in raoellids and cetaceans’. Pachyostosis of 
the bulla, referred to as the involucrum, is thought to be an adaptation 
for underwater hearing"’. Thus, by looking at the limited pachyostosis 
in hippopotamids, we can speculate that pachyostosis in the ancestors 
of whales initially evolved to counteract buoyancy, but was later co- 
opted for underwater hearing as it spread to the cranium. This and other 
hypotheses like it are tenable using our trees, but not those of Thewissen 
et al.', and underscore the importance of including different classes of 
data in phylogenetic analyses. 
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cetartiodactyls (even-hoofed ungulates) are in green, and aquatic to semi- 
aquatic cetartiodactyls are in blue. Branches are coloured (blue) to show when 
aquatic adaptations, such as bottom walking, evolved. Ina these features may 
have evolved earlier, depending on the behaviour of anthracotheres and 
cebochoerids. Details of the tree structure in a are available in the Appendix. 
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APPENDIX 


Three character codings for pakicetids from Thewissen et al.' were not followed 
here: character 38 was coded as state 1, character 142 also as state 1, and character 
105 as state 2. These character codings are based on J.H.G.’s examination of 
pakicetid specimens currently housed at the Department of Anatomy, 
Northeastern Ohio Universities College of Medicine. The entire matrix used 
in the reanalysis and the tree files are available at http://www.cetacea-evolutio- 
n.org/main/, which is supported by National Science Foundation grant DEB- 
0640361. 

Phylogenetic analyses were conducted with the computer application TNT. 
Two separate analyses were conducted: one with all characters given equal weight 
(Appendix Figs 1 and 2) and another that uses implied weighting (Appendix Fig. 
3), which dynamically down-weights characters that show greater homoplasy 
during the analysis. The degree of down-weighting is set by the constant k, and 
the default value of 3 was used. Both analyses used the default settings for a ‘New 
Technology Search’, which uses sectorial searches and tree fusing. Exceptions to 
the defaults are that the heuristic searches were terminated when the minimum 
length was found 1,000 times and the maximum number of trees saved was set to 
10,000. 
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Appendix Figure 1 | Strict consensus of equal-weights parsimony analysis. 
A total of 1,584 most parsimonious trees of 33,490 steps was found. The tree 
files can be found at http://www.cetacea-evolution.org/main/ and the strict 
consensus of these trees is shown. 
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Appendix Figure 2 | Reduced consensus of equal-weights parsimony 
analysis. Much of the lack of resolution in the strict consensus (Fig. 1) can 
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is pruned from all 1,584 most parsimonious trees, then the reduced 
consensus is as shown. 
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Appendix Figure 3 | Differential weights parsimony analysis. The analysis | shown. This tree is the basis for the simplified tree in the main text and has a 
using implied weights with k = 3 found only one optimal tree, which is fit of 2329.20679. 
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Thewissen et al. reply 


Replying to: J. H. Geisler & J. M. Theodor Nature 458, doi:10.1038/natureO7776 (2009) 


The analysis of Geisler and Theodor' confirms our main phylogenetic 
result’, that raoellids are, or include, the sister group to cetaceans. 
Their study expands on our findings by inferring that hippopotamids 
are the sister group to the combined raoellid—cetacean clade, whereas 
our paper had explicitly stated that our data could not address the 
position of the extant artiodactyl families. 

Geisler and Theodor' place confidence in their results with regard 
to hippopotamids by stating that their analysis is consistent with 
“previous phylogenetic studies”. However, one of the two articles 
that they cite’ was published after the publication of our paper, and 
they do not cite a recent paper* that disagrees with their (and our 
own) results. 

Geisler and Theodor’ execute two analyses, one with equally 
weighted characters and one with down-weighted characters 
(Appendix Fig. 3 of their paper). They state that in both analyses 
Hippopotamidae is the closest extant taxon to Cetacea, but their 
Appendix is inconsistent with this statement. In the Appendix, a large 
polychotomy lies at the base of the raoellid—cetacean clade in the equally 
weighted analysis. This polychotomy dissolves when Andrewsarchus is 
excluded from the study, which results in a strict consensus tree where 
hippopotamids are indeed the sister group to the raoellid—cetacean 
clade. Instead of discussing this consensus tree, Geisler and Theodor’ 
mention that the equally weighted analysis yielded two most parsimo- 
nious trees: one showing hippopotamids as the sister group to the 
raoellid—cetacean clade and the other showing (apparently undeleted) 
Andrewsarchus as more closely related to the raoellid—cetacean clade 
than to hippopotamids. Notably, it took the deletion of Andrewsarchus 
before hippopotamids could be identified as the raoellid—cetacean sister 
group in the unweighted analysis of the Appendix of Geisler and 
Theodor'; the text seems to identify Andrewsarchus as a possible close 
relative of hippopotamids and this is consistent with another recently 
published cladogram*. Andrewsarchus is poorly known, and this may 
result in its unstable phylogenetic position, although it may still be part 
of the hippopotamid-raoellid—cetacean clade. 

Geisler and Theodor’ agree with us that the phylogenetic link 
between cetaceans and raoellids is closer than that between cetaceans 
and hippopotamids, and we consider the former link to be more robust 
than the latter. We also believe that improving fossil collections of 
poorly known taxa is important in advancing understanding of ceta- 
cean relationships. Hippopotamids are certainly closely related to ceta- 
ceans, yet they appear about 35 million years after the first cetaceans, 
and their origins are in Africa, not the birthplace of cetaceans. 


Geisler and Theodor' further elaborate on our results by discussing 
the origin of osteosclerosis of limbs and bulla, calling this process 
pachyostosis (for definitions, see refs 5, 6). Osteosclerosis, but not 
pachyostosis, of the limbs occurs in hippopotamids, raoellids and paki- 
cetids, whereas pachyosteosclerosis of the bulla occurs in raoellids and 
pakicetids, but not in hippopotamids. Experimental studies with 
lipoprotein-receptor-related proteins in transgenic mice suggest that 
pachyostosis and osteosclerosis are underlain by different genetic 
mechanisms’. The mechanisms that lead to such hyperostoses deserve 
further study, and such a study is necessary to test the speculation of 
Geisler and Theodor' that hyperostoses of the limbs and bulla are related. 
Such a study should be rooted in a sound understanding of the develop- 
mental controls of bone formation, a field that is growing rapidly* and 
that promises to elucidate cetacean evolution considerably. 
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ARTICLES 


Activation of CaMKIl in single dendritic 
spines during long-term potentiation 


Seok-Jin R. Lee’, Yasmin Escobedo-Lozoya’, Erzsebet M. Szatmari’ & Ryohei Yasuda’ 


Calcium/calmodulin-dependent kinase II (CaMKII) plays a central part in long-term potentiation (LTP), which underlies some 
forms of learning and memory. Here we monitored the spatiotemporal dynamics of CaMKII activation in individual dendritic 
spines during LTP using two-photon fluorescence lifetime imaging microscopy, in combination with two-photon glutamate 
uncaging. Induction of LTP and associated spine enlargement in single spines triggered transient (~1 min) CaMKIl activation 
restricted to the stimulated spines. CaMKII in spines was specifically activated by NMDA receptors and L-type voltage-sensitive 
calcium channels, presumably by nanodomain Ca?* near the channels, in response to glutamate uncaging and depolarization, 


respectively. The high degree of compartmentalization and channel specificity of CaMKII signalling allow stimuli-specific 
spatiotemporal patterns of CaMKII signalling and may be important for synapse-specificity of synaptic plasticity. 


CaMKIlisa serine/threonine protein kinase consisting of 12 subunits’. 
Each subunit is activated by the association of Ca”*-bound calmodulin 
(Ca?*/CaM). When a CaMKII subunit is autophosphorylated at site 
T286, its activity can be maintained after the dissociation of Ca?*/CaM 
(ref. 2). It has been suggested that CaMKII activation in postsynaptic 
density may persist long term (more than hours) to maintain LTP’. 
CaMKII interacts with several ion channels including the NR2B 
subunit of NMDA receptors (NMDARs)* and voltage-sensitive 
calcium channels (VSCCs)*, suggesting that CaMKII activation may 
be initiated within the nanodomains of these channels to produce 
channel-specific signalling. However, at the level of single synapses it 
is not known whether CaMKII activation is channel-specific, to what 
degree CaMKII activation is compartmentalized, and whether it 
persists in the stimulated spine during LTP. 

To measure CaMKII activation in spines, we developed a fluor- 
escence resonance energy transfer (FRET)-based CaMKIIa sensor by 
modifying the previously reported CaMKII sensor, Camuia, in which 
the amino and carboxy termini of CaMKII« are labelled with donor and 
acceptor fluorophores’. To optimize the sensitivity and brightness, we 
used the FRET pair of monomeric enhanced green fluorescent protein 
(mEGFP)° and resonance energy-accepting chromoprotein (REACh), a 
non-radiative yellow fluorescent protein variant’ (green-Camuia), and 
measured FRET using two-photon fluorescence lifetime imaging 
microscopy*"°. The activation of green-Camuix should change the 
conformation of CaMKII« to the open state in which its kinase domain 
is exposed’, thereby decreasing FRET and increasing the fluorescence 
lifetime of mEGFP (Supplementary Fig. 1). We confirmed that green- 
Camuie. is incorporated into a CaMKII dodecameric holoenzyme, and 
that the fluorescence lifetime of green-Camuix reports CaMKII activa- 
tion associated with T286 phosphorylation as well as Ca7*/CaM bind- 
ing (Supplementary Information). We biolistically'' transfected CA1 
pyramidal neurons in hippocampal cultured slices with green-Camuia 
and mCherry. The expression level of green-Camuic. was estimated to 
be 25-50% of the level of endogenous CaMKIIu subunit, whose con- 
centration was estimated at ~10-100 uM (Supplementary Fig. 2 and 
Supplementary Information). 


CaMKII activation during spine structural plasticity 
Using this technique, we imaged CaMKII activation during structural 
plasticity of dendritic spines, which is considered to be associated with 


LTP'** (Fig. 1). To induce synapse-specific structural plasticity, we 
applied a low frequency train of two-photon glutamate uncaging pulses 
(45 pulses at 0.5 Hz) to a single dendritic spine in zero extracellular 
Mg?* (refs 13, 14). The spine volume increased quickly after glutamate 
uncaging by 376 + 68% (mean + s.e.m.) (Fig. 1c, e) and relaxed to an 
elevated level at 104 + 21% for more than 30 min (Fig. Ic, f)!*'*. The 
structural plasticity was associated with accumulation of green-Camuio 
in the stimulated spines that was roughly proportional to the volume 
change (Supplementary Fig. 3)'°. Structural plasticity of dendritic 
spines was abolished by blocking NMDARs (100M 2-amino-5- 
phosphonopentanoic acid (AP5))'° (Fig. 1c, e, f). A CaMKII inhibitor 
(10 uM KN62) partially inhibited the sustained structural plasticity 
(Fig. Ic, f), but not the transient phase (Fig. lc, e), suggesting that the 
activation of CaMKII is required for the maintenance of the spine 
volume change’*. Overexpression of the T286A mutant of green- 
Camuio reduced the sustained spine enlargement (Fig. 1c, fand Supple- 
mentary Fig. 4), demonstrating that this mutant acts as dominant 
negative. Because the overexpression level of the T286A mutant is 
relatively small (10-20%; see Supplementary Information), inhibiting 
autophosphorylation of T286 in a relatively small fraction of CaMKII 
subunits in a holoenzyme may be sufficient for inhibiting structural 
plasticity. This result is consistent with previous studies reporting that 
LTP is attenuated in T286A heterozygous knockout mice'®. 

During structural plasticity, the activity of CaMKII, as measured by 
the fluorescence lifetime of green-Camuia, increased rapidly (Fig. la, 
b). The activity was restricted to the stimulated spines, and did not 
spread into dendrites or adjacent spines (Fig. la, b, d). Activation of 
CaMKII preceded the enlargement of spine volume, suggesting that 
CaMKII triggers molecular processes that lead to the structural plas- 
ticity (Fig. 1a, b, c). The NMDAR blocker AP5 inhibited the fluor- 
escence lifetime change, indicating that CaMKII is activated by Ca** 
from NMDARs (Fig. 1b, d). The CaMKII inhibitor KN62 decreased the 
fluorescence lifetime change by 40% (Fig. 1b, d). The T286A and T305D 
mutant sensors reported less activation, suggesting that the binding of 
calmodulin and the phosphorylation of T286 is required for the full 
activation of CaMKII in spines (Fig. 1b, d and Supplementary Fig. 4). 


Mechanisms of spine-specific CaMKII activation 
In general, the degree of compartmentalization of CaMKII activation 
is determined by two factors: the inactivation time constant of 
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Figure 1| Simultaneous measurements of CaMKII activation and structural 
plasticity in single spines using two-photon fluorescence lifetime imaging 
microscopy combined with two-photon glutamate uncaging. 

a, Fluorescence lifetime images of green-Camuix during the induction of 
spine structural plasticity by two-photon glutamate uncaging in the absence 
of extracellular Mg”" . Longer lifetimes indicate increased activity. The white 
arrowhead indicates the location of uncaging laser spot. b, Averaged time 
course of fluorescence lifetime change of green-Camuix in the stimulated 
(Stim) spine and adjacent (Adj) spines (within 5 jtm of stimulated spine). 
Data using pharmacological inhibitors and T286A mutant green-Camuio 
are also shown. The number of samples (spines/neurons) is 35/30 for 
stimulated spines, 29/28 for adjacent spines, 5/4 for AP5, 7/5 for KN62 and 
18/11 T286A. Right panel shows expanded view of fluorescence lifetime 
change during uncaging. c, Averaged time course of spine volume change in 
stimulated spines and adjacent spines. Right panel shows expanded view of 
the volume change. d, CaMKII activation (averaged over 0-40 s or first five 
points). Asterisks denote statistical significance between control and others 
(P< 0.05). e, Transient volume change (average volume change over 
1.5—2 min subtracted by that averaged over 25-30 min)"’. f, Sustained 
volume change (volume change averaged over 25-30 min). Error bars are 
s.e.m. 


CaMKIU activity, and the mobility of CaMKII (refs 10, 17). Thus, we 
measured these parameters to study the mechanisms by which 
CaMKI activation is restricted. To measure the kinetics of CaMKII 
precisely, we imaged CaMKII activity with higher temporal resolu- 
tion (2s) in response to a brief train of glutamate uncaging pulses 
(0.5 Hz, eight pulses in zero extracellular Mg”*). The activation of 
CaMKII reached a plateau within 6 s, and after the cessation of 
uncaging, it was inactivated with two time constants, 6 and 45s 
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Figure 2 | Measurements of parameters that determine spine-specificity: 
inactivation kinetics and mobility of CaMKII. a, Measurements of CaMKII 
activation and inactivation after brief stimulation (eight pulses, 0.5 Hz) of 
spines in zero extracellular Mg”*. The solid cyan curve indicates double 
exponential functions obtained by fitting to wild-type data (time constants: 
6s (80%) and 45s (20%)). The dashed cyan curve indicates single 
exponential function with 1 s time constant. The number of samples (spines/ 
neurons) is 25/5 for wild type, 28/4 for T286A. b, CaMKII inactivation 
kinetics after different number of uncaging pulses (1, 8 and 45 pulses, 

0.5 Hz). Data were fitted with double exponential functions (cyan curves). 
The time constants were 9.3 s (71%) and >100s (29%) for 1 pulse, 4.7 s 
(61%) and 136.6 s (39%) for 8 pulses, and 4.5 s (61%) and 102 s (39%) for 45 
pulses. The number of samples (spines/neurons) is 28/9 (1 pulses), 16/8 (8 
pulses) and 13/7 (45 pulses). c, Measurements of spine—dendrite coupling of 
CaMKII using paGFP. The top panel shows green and red fluorescence 
before and after photoactivation in a neuron expressing REACh-CaMKIIa- 
paGFP (green) and mCherry (red). Scale bars, 1 tm. The bottom panel 
shows the averaged time course of the red/green ratio. Signal was normalized 
to the peak fluorescence change. The decay time constants were 1.3 min 
(48%) and 17.0 min (52%) for REACh-CaMKII«-paGFP (solid cyan curve), 
and 1.0 min (53%) and 20.7 min (47%) for paGFP-CaMKIIa (dashed cyan 
curve). The number of samples (spines/neurons) is 15/5 (REACh-CaMKIIa- 
paGFP) and 22/9 (paGFP-CaMKII«). d, Spine-dendrite coupling of green- 
Camuic during structural plasticity. During glutamate uncaging (720 nm, 
6 ms, 45 pulses in Mg”* free solution), REACh-CaMKIIx-paGEP can be 
photoactivated. We subtracted the normalized red from the normalized 
green fluorescence, assuming that the small baseline fluorescence of REACh- 
CaMKIIu-paGFP behaves similarly to the mCherry fluorescence change (see 
Supplementary Fig. 3). The decay time constants were obtained as 1.6 (82%) 
and 15.3 (18%) min (cyan curve). The number of samples (spines/neurons) 
is 10/7. All error bars are s.e.m. 


(Fig. 2a). T286A green-Camuia was activated much less than wild 
type, and decayed within 2 s (Fig. 2a). These results indicate that T286 
phosphorylation prolongs CaMKII activation and allows the accu- 
mulation of activated CaMKII during repetitive glutamate uncaging. 
To measure how the decay kinetics are regulated by CaMKII activa- 
tion, we stimulated spines with different numbers of uncaging pulses 
from 1 to 45 (Fig. 2b). With 45 uncaging pulses, we observed enlarge- 
ments of spines. Our data indicate that the decay time constants 
depend on neither the number of stimulations nor the level of the 
CaMKII activation (Fig. 2b). 

Next, we measured the mobility of CaMKII, another determinant of 
the compartmentalization of CaMKII, using photoactivatable GFP 
(paGFP; Fig. 2c, d). We used either a green-Camuia variant with 
EGFP replaced by photoactivatable GFP (REACh-CaMKII-paGFP) 
or paGFP-tagged CaMKII (paGFP-CaMKII«). After photoactivation 
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of paGFP, green fluorescence decayed owing to the diffusion of 
CaMKIla out of the spine with two time constants, ~1 and 
~20min (Fig. 2c). Fluorescence recovery after photobleaching 
(FRAP) of green-Camuic. also showed similar recovery time constants 
(Supplementary Fig. 5b, c). We also measured the mobility of CaMKII 
during structural plasticity by monitoring the decay of REACh- 
CaMKIlo-paGFP which is photoactivated by glutamate uncaging 
protocol (6ms pulse, 45 times). The photoactivated REACh- 
CaMKIla-paGFP decayed with the time constants similar to the resting 
condition (Fig. 2d). Consistent with this spine—dendrite coupling time 
constant, accumulation of CaMKIIa into spines during the transient 
phase of structural plasticity was smaller than the spine volume change 
(Supplementary Fig. 3). Because the inactivation of CaMKII is faster 
than its diffusion out of the spine, CaMKII activation must be 
restricted to the stimulated spine, which agrees with our measurements 
of CaMKII activity during structural plasticity (Fig. 1). 


CaMKII activation during pairing-induced LTP 


In the presence of extracellular Mg”*, LTP induction in hippocampal 
CAI synapses requires coincident postsynaptic depolarization and 
synaptic activation to release Mg”* block of NMDARs'*. To study the 
dynamics of CaMKII activation under such conditions, we imaged 
CaMKII activation during LTP induced by pairing postsynaptic 
depolarization with two-photon glutamate uncaging in the presence 
of extracellular Mg?" (1 mM)” (Fig. 3). We performed whole-cell 
patch clamp recordings on neurons transfected with green-Camuix 
and mCherry, and measured the postsynaptic current evoked by two- 
photon glutamate uncaging (uncaging-evoked EPSC) at —65 mV. 
LTP was induced by pairing uncaging pulses (30 pulses at 0.5 Hz) 
with postsynaptic depolarization (0 mV). This LTP protocol induced 
a sustained increase in both uncaging-evoked EPSC and spine 
volume by ~150%, which was maintained for more than 60 min 
(Fig. 3c, d). In the same LTP protocol, depolarization produced large 
calcium increases in spines (2.6 + 0.5 uM) and in dendritic shafts 
(2.6£0.5uM), presumably due to the activation of VSCCs'® 
(Supplementary Fig. 6a—c). Although the depolarization was held 
for more than 1 min, [Ca**] decayed rapidly within 2 s”° (Supple- 
mentary Fig. 6b, c). On top of small residual Ca”* (0.21 + 0.07 uM), 
subsequent glutamate uncaging produced calcium transients 
between 1.8+0.3uM (first uncaging) and 0.8+0.2uM (30th 
uncaging), largely restricted to the heads of the stimulated spines*!”* 
(Supplementary Fig. 6b, c). 

In response to depolarization, CaMKII activity increased in dend- 
ritic shafts (lifetime change = 0.079 + 0.018 ns), but less so in spines 
(0.028 + 0.010 ns) (Fig. 3a, b). Glutamate uncaging pulses caused 
further CaMKII activation in the stimulated spines (0.097 + 0.014 ns), 
but not in the adjacent spines (Fig. 3a, b). CaMKII activation in spines 
and dendritic shafts rapidly decayed within ~2 min. The large gradient 
of CaMKII activation between spines and dendritic shafts during LTP 
induction demonstrates the high degree of compartmentalization of 
CaMKII activation in individual spines (Fig. 3a, b). 


Channel-specific activation of CaMKII 


It has been reported that Ca?* through VSCCs is required for LTP 
induced by several protocols'®”***. Therefore, we further characterized 
the depolarization-induced CaMKII activation with higher temporal 
resolution (2 s; Fig. 4). We found that CaMKII activation in dendritic 
shafts increased within 2s after depolarization, was sustained during 
the depolarization (16s), and then decayed rapidly after repolarization 
(Fig. 4a). Activation of CaMKII in spines (0.034 + 0.004 ns) was smaller 
than in dendrites (0.076 + 0.007 ns). [Ca’~] during the depolarization 
decayed within 2's, and little extra Ca** remained for more than 10s 
(0.21 + 0.07 1M) (Supplementary Fig. 6b). Thus, these results indicate 
that small sustained increases in [Ca**] are sufficient to sustain 
CaMKII activation during the depolarization for more than ~10s. 
The T286A mutant was inactivated much more rapidly (in less than 
4s) than the wild type during the depolarization (Fig. 4b), indicating 
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Figure 3 | Imaging CaMKII activation during LTP induction by two-photon 
glutamate uncaging paired with postsynaptic depolarization. 

a, Fluorescence lifetime images of green-Camuix during the induction of 
LTP by two-photon glutamate uncaging paired with postsynaptic 
depolarization. b, Averaged time course of fluorescence lifetime change of 
green-Camuiz in the stimulated spine, adjacent spines (within 5 um of 
stimulated spine), and dendritic shaft close to the stimulated spine. Right 
panel shows extended view of fluorescence lifetime change during uncaging. 
Results are the average of 8 stimulated spines, 13 adjacent spines and 8 
dendrites from 8 neurons. The plot of stimulated spines is noisier due to the 
smaller number of samples. Plots of stimulated spines in b—d are from the 
same spines. c, Averaged time course of changes in spine volume and the 
amplitude of uncaging-evoked EPSC (uEPSC) at —65 mV in the stimulated 
(Stim) spine and adjacent (Adj) spines (within 5 um). For uncaging-evoked 
EPSC, each point is the average of four trials at 5-s intervals. The uncaging 
spot was relocated to the tip of spines before each of the four trial 
measurements. d, Uncaging-evoked EPSC at single spines before and 60 min 
after LTP induction (average of four trials, filtered with 2-ms window). All 
error bars are s.e.m. 


that the prolonged activation of CaMKII requires autophosphorylation 
at T286, and the decay of CaMKII activity after repolarization is due to 
the dephosphorylation of T286. Bursts of back-propagating action 
potentials (83 Hz, 0.5 s, eight times, 2 s interval) also produced similar 
CaMKII activation in spines and dendrites (Fig. 4d). 

Nimodipine, a blocker of L-type VSCCs, abolished CaMKII activa- 
tion in dendritic spines during depolarization (Fig. 4c). The peak of 
CaMKII activation in dendritic shafts was not affected by the blockade 
of L-type VSCCs, but the activity decayed more rapidly (Fig. 4c). 
Because L-type VSCCs produce only a small fraction of Ca** in spines 
and dendritic shafts during cell depolarization (Supplementary 
Fig. 6d), L-type VSCC-dependent CaMKII activation presumably 
occurs in the nanodomain at the mouth of L-type VSCCs*”””*. Also, 
global Ca*~ increase by opening of non-L-type VSCCs did not activate 
CaMKII in spines (Fig. 4c), although it is as large as uncaging-evoked 
Ca** increase (Supplementary Fig. 6). This result indicates that global 
Ca’* increase alone is insufficient to activate CaMKII in spines. 
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Figure 4 | Differential activation of CaMKII in spines and dendrites by 
postsynaptic depolarization. a, High temporal resolution measurements of the 
fluorescence lifetime change of green-Camuic in spines and dendrites in 
response to postsynaptic depolarization (0 mV, 16s, indicated by the bar) in 
spines (red) and dendritic shafts (black). Results are the average of 53 spines and 
30 dendrites from 12 neurons. The cyan lines are a double exponential function 
obtained by fitting. The time constants are 11.3 s (84%) and >300 s (16%) for 
spines, and 6.8 s (60%) and 43.7 s (40%) for dendrites. b, Fluorescence lifetime 
change of T286A mutant of green-Camuic. Results are the average of 19 spines 
and 14 dendrites from 8 neurons. c, Fluorescence lifetime change of green- 
Camuiz in the presence of L-type VSCC inhibitor (20 LM Nimodipine). Results 
are the average of 22 spines and 15 dendrites from 7 neurons. d, Bursts of back- 
propagating action potentials (BAP)-induced CaMKII activation at proximal 
dendrites (<100 um). We applied eight back-propagating action potential 
bursts, each 83 Hz for 0.5 s, at 2-s intervals. Results are the average of 22 spines 
and 12 dendrites from 7 neurons. All error bars are s.e.m. 
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To test further the hypothesis of the nanodomain activation of 
CaMKII, we performed whole-cell patch clamping using a pipette 
including the Ca”* chelators EGTA or BAPTA. BAPTA and EGTA 
have similar dissociation constants to Ca**, but the binding rate of 
BAPTA is ~100 times higher”. Thus, EGTA inhibits global Ca7* over 
nanodomain Ca** more selectively than BAPTA does”. The activa- 
tion of CaMKII induced by depolarization was inhibited by BAPTA 
much more effectively compared to the same concentration of EGTA 
(Fig. 5a—f), suggesting that nanodomain Ca’~ near calcium channels 
is sufficient to activate CaMKII both in spines and in dendrites. With 
EGTA loaded in neurons, depolarization induced larger CaMKII 
activation in spines, suggesting that there is global Ca**-dependent 
CaMKIU inactivation (Fig. 5b, c). Slower CaMKII activation kinetics in 
the presence of EGTA (Fig. 5b, c) are presumably caused by smaller 
Ca’ *-dependent inactivation combined with smaller activation per 
unit time due to reduced Ca** increase. Unlike depolarization- 
induced CaMKII activation, EGTA and BAPTA both similarly 
reduced glutamate-uncaging-induced CaMKII activation (Fig. 5g-i), 
suggesting that nanodomain Ca’~ alone is not sufficient for uncaging- 
induced CaMKII activation. 


Discussion 


In this study, we visualized CaMKII activation in single dendritic spines 
during LTP. Using this method, we demonstrated that CaMKII activa- 
tion is largely restricted to the stimulated spine during LTP induced by 
two-photon glutamate uncaging (Figs 1 and 3). This suggests that 
CaMKII activation is important for the synapse specificity of LTP. 
This finding is in contrast to the activation of Ras, which is another 
molecule essential for LTP induction’. In response to LTP-inducing 
stimuli, Ras is activated in stimulated spines, and then activated Ras 
spreads over ~ 10 um of dendritic length and invades nearby spines by 
diffusion’®. The restricted CaMKII activation can be explained by the 
rapid CaMKII inactivation in spines and dendrites (~5s and ~1 min, 
Figs 2a, b and 4a) and slow spine-dendrite diffusion coupling (~1 and 
~20 min, Fig. 2c, d)*': activated CaMKII at the stimulated spine gets 
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Figure 5 | Effects of Ca2* chelators on CaMKII activation. 
a-e, Fluorescence lifetime change of green-Camuix in spines and dendrites 
in response to postsynaptic depolarization (0 mV, 0-16 s, indicated by the 
black bar) in spines (red) and dendritic shafts (black) in neurons patch- 
clamped with electrodes containing no Ca”* chelator (a), 5mM EGTA 
(b), 20 mM EGTA (c), 5mM BAPTA (d) or 20 mM BAPTA (e). The number 
of samples (spines/dendrites/neurons) is 18/11/6 for control (a), 29/14/7 for 
5mM EGTA (b), 15/7/4 for 20 mM EGTA (c), 21/7/5 for 5mM BAPTA 
(d) and 18/7/4 for 20 mM BAPTA (e). Imaging was performed more than 
15 min after establishing a whole-cell patch. f, Dependence of fluorescence 
lifetime change (averaged over 0-16 s) by depolarization on Ca*" chelator 
concentration (EGTA: closed circle, BAPTA: open circle, red: spine, black: 
dendrite). Fluorescence lifetime was normalized to control. g, h, Fluorescence 
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lifetime change of green-Camuiz in response to glutamate uncaging (6 ms, 
45 stimuli in zero extracellular Mg”*) in neurons patch-clamped with 
electrodes containing 1-20 mM EGTA (g) or 1-5 mM BAPTA (h) ina 
current-clamp mode. Controls are measured in the same neurons before 
patch-clamping. Patch-clamping with internal solution without Ca?* 
chelator did not alter the fluorescence lifetime change (Supplementary Fig. 
7). The number of samples (spines/neurons) is 22/9 for control in g, 12/3 for 
1mM EGTA, 15/3 for 5mM EGTA, 17/3 for 20 mM EGTA, 16/7 for control 
in h, 17/4 for 1mM BAPTA and 14/3 for 5mM BAPTA. i, Dependence of 
fluorescence lifetime change (averaged over 0-40 s) induced by glutamate 
uncaging on Ca”* chelator concentration (EGTA: open circle, BAPTA: 
closed circle). Fluorescence lifetime was normalized to control. Error bars 
represent s.e.m. 
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inactivated before it diffuses out of the spine. In contrast, Ras inactiva- 
tion is much slower (~5 min) and the spine—dendrite coupling time 
constant is smaller (~5s)'°. This allows activated Ras to spread and 
invade nearby spines before it gets inactivated. Because CaMKII and 
Ras act in parallel pathways to induce LTP", the spine specificity of 
LTP may be produced by CaMKII, whereas Ras may be permissive for 
LTP and responsible for dendritic level signalling such as the regulation 
of threshold for LTP induction’. 

Rapid inactivation of CaMKII during LTP induction (Figs 1 and 3) 
is consistent with previous studies showing that persistent activation 
of CaMKII is not required for LTP maintenance****. However, this 
seems to be inconsistent with biochemical studies demonstrating that 
T286 phosphorylation is sustained after LTP induction**”». This dis- 
crepancy may be due to the difference in sample preparation (acute 
slices versus cultured slices), in the population of CaMKII measured 
(whole lysate including soma versus single synapses), or in stimu- 
lation models (most of the previous biochemical studies used strong 
stimulations to induce LTP in a large number of synapses, whereas we 
induced LTP in a single synapse). Alternatively, only the subpopula- 
tion of CaMKII subunits that bind to NR2B subunit of NMDARs 
may maintain their activity***. Given that there is more CaMKII than 
NMDARs in the postsynaptic density”, the subpopulation of 
CaMKII that undergoes persistent activation may be below the detec- 
tion threshold of our method. 

We found that T286 phosphorylation is required for high CaMKII 
activation in response to repetitive two-photon uncaging (Figs 1b 
and 2a). [Ca**] transients last only ~0.2s after each uncaging 
(Supplementary Fig. 6a—c), owing to strong Ca*~ extrusion in spines 
and dendrites’. Thus, Ca”*/CaM should dissociate from CaMKII 
rapidly after these stimulations as indicated in the activation time 
course of T286A green-Camuie (Figs 2a and 4b). The phosphoryla- 
tion of T286 prolongs the activation of CaMKII by ~5 s (Figs 2a and 
4a), and allows for the accumulation of activated CaMKII during 
repetitive uncaging (Fig. 2a). 

Our imaging results indicate that CaMKII activation in dendritic 
spines and dendritic shafts are regulated by different Ca** sources in 
response to cell depolarization: CaMKII in dendritic spines is activated 
specifically by L-type VSCCs, whereas CaMKII in dendritic shafts is 
activated mostly by non-L-type channels in addition to a small con- 
tribution from L-type VSCCs (Fig. 4). Because L-type VSCCs do not 
contribute to the global Ca** in spines and dendritic shafts 
(Supplementary Fig. 6d), the L-type-VSCC-dependent activation of 
CaMKII must be produced in the nanodomain at the mouth of 
L-type VSCCs*?”"8, Experiments using Ca”* chelators with different 
binding rates (Fig. 5a-f) also suggest that nanodomain Ca’~ at calcium 
channels is sufficient for CaMKII activation in response to cell 
depolarization, both in spines and dendrites. Because L-type VSCCs 
are required for hippocampal LTP induced by several protocols'””*°, 
CaMKII activation by L-type VSCCs, although smaller than that by 
NMDARs (Figs 3 and 4), is probably important for the modulation of 
LTP. 

Unlike CaMKII activation in response to depolarization, nanodo- 
main Ca*™ alone was insufficient for NNDAR-mediated CaMKII 
activation (Fig. 5g~i). Because global Ca”* increase through non-L- 
type VSCCs was not sufficient to activate CaMKII in spines (Fig. 4c) 
(although it is as high as and longer-lasting than the uncaging-evoked 
[Ca’*] increase; Supplementary Fig. 6d), NNDAR-mediated CaMKII 
activation presumably requires both global and nanodomain Ca’~. 
Thus, CaMKII activation mechanisms depend on the channels from 
which Ca’* influx occurs. The difference may be due to different CaM 
localization near channels. Also, the dynamics of Ca**-dependent 
protein phosphatase activity are probably important in shaping the 
spatiotemporal dynamics of CaMKII (Fig. 5a—f). 

Most LTP induction protocols involve postsynaptic depolariza- 
tion'®, which alone can produce large Ca** in spines (Supple- 
mentary Fig. 6)'°. However, it had remained a mystery as to why 
postsynaptic depolarization alone cannot induce synapse nonspecific 
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LTP'*”. Our results demonstrate that depolarization activates only a 
subset of CaMKII that interact with L-type VSCCs, which presumably 
is insufficient for LTP induction**. Thus, local Ca?*-mediated, 
channel-specific CaMKII activation in spines has an important role 
in the induction of synapse-specific LTP. 


METHODS SUMMARY 


Green-Camuix was made from Camuia’ by replacing ECFP with mEGFP and 
Venus with REACh (Venus(A206K/T145W)”*?). Hippocampal cultured slices 
were prepared from postnatal day 6-8 rats as described*’. Neurons were sparsely 
transfected with green-Camuiax and mCherry using ballistic gene transfer’ a 
days 8-14 in vitro, and imaged 2-5 days after transfection. CaMKII activity was 
measured using two-photon fluorescence lifetime imaging microscopy and 
spine volume change was monitored by measuring the fluorescence intensity 
of mCherry (red) in spines using regular two-photon microscopy”””. Statistica 
tests were performed using analysis of variance (ANOVA) followed by the leas 
significant difference post-hoc tests. All error bars indicate s.e.m. 


Full Methods and any associated references are available in the online version o 
the paper at www.nature.com/nature. 
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METHODS 

Constructs. Camuia (Venus-CaMKIIo—CFP) and short-hairpin RNA (shRNA) 
against CaMKII was provided by Y. Hayashi’. Green-Camuia was made by 
exchanging Venus with REACh (Venus*! with monomeric mutation, A206K 
(ref. 6), and dark mutation, T145W (refs 7, 39, 42)) and CFP with mEGFP 
(EGFP(A206K))*’. 

Preparation. Hippocampal slice cultures were prepared from postnatal day 6 or 
7 rats, as described*®. After 9-14 days in culture, cells were transfected with 
ballistic gene transfer'' using gold beads (9-11 mg) coated with plasmids con- 
taining cDNA of green-Camui (35 pg) and mCherry (16 jig). We used one neu- 
ron per slice for most of our experiments. Samples were prepared in accordance 
with the animal care and use guidelines of Duke University Medical Center. 
Imaging. Details of FRET imaging using two-photon fluorescence lifetime 
imaging microscopy have been described previously’. Green-Camuio, and 
mCherry were simultaneously excited with a Ti:Sapphire laser tuned at 900 or 
920 nm”. Both epi- and trans-fluorescence were used for the detection. The 
fluorescence was divided with a dichroic mirror (565nm) and detected with 
photoelectron multiplier tubes (PMTs) placed after wavelength filters 
(Chroma, 510nm/70nm (centre wavelength/bandwidth) for green and 
620 nm/90 nm for red). For fluorescence lifetime imaging in the green channel, 
PMT with low transit time spread (Hamamatsu) placed at epi-fluorescence site 
was used. For intensity imaging, wide aperture PMTs were used (Hamamatsu). 
Fluorescence lifetime image was produced on a PCI board (Becker-Hickl) con- 
trolled with custom software written using Matlab 7.0. 

Fluorescence lifetime image analysis. Fluorescence lifetime of green-Camuia 
should have at least three components including open conformation, close con- 
formation, and donor with unfolded acceptor’. Thus, it is difficult to make a 
complete model and perform curve fitting. Instead, the mean fluorescence life- 
time averaged over several populations t,,, was measured from the mean photon 
arrival time < t> as follows’: 


f dt-tF(t) 
Jf dt-F(t) 


Tm = (t) — to to 
in which F(t) is the fluorescence lifetime decay curve (Supplementary Fig. 1a), tis 
time, and { is the offset. We estimated the offset f) before each experiment by 
fitting fluorescence lifetime decay curve of green-Camui« expressed in a neuron 
with double exponential function and comparing < t> with the fluorescence 
lifetime averaged over two populations as follows: 


J dt-tF(t) J dt-tF(t) 


t = Pitt + Pyth 
oO de R) "Pde F(t) 


Pit, + Pyt, 


in which P, and P, are the fraction and 7, and 72 are the fluorescence lifetime of 
the two populations obtained by fitting. Change in fluorescence lifetime is inde- 
pendent of to, therefore it depends on neither model nor curve fitting. 

The fluorescence lifetime of green-Camui«. in spines under resting condition 

was 1.82 + 0.02 ns for wild type (samples in Fig. 1 and Supplementary Fig. 4 
pooled together), 1.84 + 0.02 ns for T286A (Fig. 1 and Supplementary Fig. 4) and 
1.81 + 0.03 ns for T305D (Supplementary Fig. 4). The fluorescence lifetime 
under resting condition in the presence of pharmacological drugs was 
1.77 + 0.06 ns for AP5 (Fig. 1), 1.67 + 0.03 ns for KN62 (Fig. 1), 1.81 + 0.02 ns 
for Nimodipine (Fig. 4), 1.80+0.01ns for 1-20mM EGTA (Fig. 5), and 
1.83 + 0.02 ns for 1-20 mM BAPTA (Fig. 5). Thus, most of the drugs used in 
this study did not change the fluorescence lifetime, except that KN62 decreased 
the fluorescence lifetime significantly (P< 0.05, comparison between control 
and KN62 for samples in Fig. 1). 
Two-photon glutamate uncaging. A laser tuned at 720 nm was used to uncage 4- 
methoxy-7-nitroindolinyl (MNI)-caged L-glutamate (2 mM) in extracellular solu- 
tion. The same galvano-scanning mirrors were used to steer both imaging and 
uncaging lasers. The laser spot was steered to the tip of a spine image 
(Supplementary Fig. 8), and 4-6 ms pulses at 4mW were applied to stimulate 
glutamate receptors on the spine. This produced a 2-20 pA postsynaptic current 
(Fig. 3d, Supplementary Fig. 8). The resolution of glutamate uncaging was ~1 jm 
(Supplementary Fig. 8). The intensity of the laser was modulated by a pockels cell 
(Con-Optics). For LTP experiment (Fig. 3), the uncaging spot was relocated before 
each uncaging session (each session includes four trials of glutamate uncaging at 
0.2 Hz). The session was repeated every 10 min (Fig. 3c). Series and input resistance 
were continuously monitored, and if they changed more than 20%, the measure- 
ments were rejected from further analysis. To induce structural plasticity of spines 
(Figs 1, 2a, b, d and 5g-i), we used ACSF (130mM NaCl, 2.5mM KCl, 2mM 
NaHCO, 1.25mM NaH,PO, and 25mM glucose) containing 4mM CaCl, 
1M tetrodotoxin (TTX) and 2mM MNI-caged L-glutamate at 25-27 °C. To 
induce LTP with the pairing protocol (Fig. 3), we added 1 mM MgCh. 
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FRAP (and photoactivation). For FRAP (Supplementary Fig. 4b, c), spine fluor- 
escence was photobleached by parking the imaging laser (tuned at 920 nm) for 
1-85 at a spine of interest. To activate paGFP (Fig. 2c, d), the uncaging laser 
(tuned at 800 nm) was parked for 20 ms at a spine of interest. 
Electrophysiology. Whole-cell patch clamping was performed with patch pipettes 
(4-7 MQ). We used cells with series resistance lower than 25 MQ for experiments. 
To induce LTP with the pairing protocol (Fig. 3) or to induce cell depolarization 
(Figs 4a—d and 5a-f), we performed experiments in a voltage-clamp mode using 
Cs* internal solution (130 mM CsMeSO3, 10 mM sodium phosphocreatine, 4 mM 
MgCh, 4mM Na,ATP, 0.4mM Na,GTP, 10 mM Cs-HEPES, pH 7.3). To induce 
back-propagating action potentials (Fig. 4d), we injected 4-nA, 4-ms current 
pulses in a current-clamp mode using K* based internal solution (Cs* was sub- 
stituted with K*) and ACSE with 4mM CaCl,, 4mM MgCh, 1 uM TTX, 100 pM 
AP5 and 10uM_ 2,3-dioxo-6-nitro-1,2,3,4-tetrahydrobenzo|f] quinoxaline-7- 
sulphonamide (NBQX). For experiments using BAPTA or EGTA (Fig. 5), we 
started imaging ~ 15 min after establishing the whole-cell patch clamp with pipette 
including BAPTA or EGTA. Depolarization (Fig. 5a-f) and glutamate uncaging 
(Fig. 5g—i) were performed in voltage and current-clamp modes, respectively. To 
prepare internal solution with 1, 5 and 20 mM EGTA or BAPTA (Fig. 5), we 
prepared 100mM K-EGTA and K-BAPTA (pH7.3), and mixed 1%, 5% and 
20% volume of this solution and 99%, 95% and 80% volume of Cs" - (Fig. 5a—f) 
or K*- (Fig. 5g-i) based internal solution, respectively. Then, we re-adjusted the 
osmolarity to 290-300 and the pH to 7.3. 

Calcium imaging. Ca** imaging (Supplementary Fig. 6) was performed as 
described previously*’. We performed whole-cell patch clamping to a neuron 
in a cultured hippocampal slice using an electrode containing Ca**-sensitive 
green dye (500 UM Fluo-4FF) and Ca’ * -insensitive red dye (300 uM Alexa-594) 
in Cs* internal solution. Images were acquired every 32 ms (Supplementary 
Fig. 6a—c) or 256ms (Supplementary Fig. 6d). The ratio of the fluorescence 
change of green normalized to red signal averaged over an imaging session 
(AG/R) was used as a measure of the [Ca”*] transient. Both fluorophores were 
simultaneously excited with a Ti:Sapphire laser tuned at 920 nm. The change in 
[Ca**] (A[Ca?*]) was measured as: 


_ (AGp) Kp 
(CR dat 


in which (G/R).at is the ratio of green and red signals at saturating Ca** (10mM) 
measured in a pipette, and the dissociation constant (K4) is 10.4 uM for Fluo-4FF”. 
Measurements of the concentration of green-Camuia in cells. To measure the 
concentration of mEGFP—CaMKIl« or green-Camui in neurons, we measured 
the fluorescence intensity of mEGFP at thick dendrites of neurons under two- 
photon microscopy relative to the brightness of purified mEGFP (1 1M) 
measured under the same imaging setup. When measuring objects larger than 
two-photon excitation volume (~0.9 femtolitre), the brightness should be pro- 
portional to the concentration of fluorophores. We overexpressed mEGFP tagged 
with poly-histidine tag (Hiss) (pRSET-Hiss-mEGEP) in Escherichia coli and puri- 
fied them using a Nickel-NTI affinity column. The concentration of the purified 
protein was measured by the absorbance of MEGFP (Aygo = 56,000 cm ~ 7 1)#, 
For green-Camuia, because FRET reduces its green fluorescence, we corrected for 
this effect by multiplying TmEGEP! Tgreen-Camuio ~1.4, in which TmEGEP and 
Tgreen-Camuia, are the fluorescence lifetime of mEGFP (2.6 ns) and green-Camui 
(1.8 ns), respectively. 

In vitro fluorescence lifetime assay. To measure the fluorescence lifetime of 
green-Camuis in cell lysates (Supplementary Fig. 1), we transfected HEK-293 
cells with green-Camuia, using Lipofectamine 2000. Cells were lysed with a 
solution containing 40 mM HEPES (pH 8.0), 0.1 mM EGTA, 5mM magnesium 
acetate, 0.01% Tween-20, and concentrated with a centrifugal filter (Microcon, 
Millipore) to remove CaM and ATP. The lysate was diluted 15 times in 136.5 mM 
potassium gluconate, 17.5mM KCl, 9mM NaCl, 1mM MgCh, 0.2mM EGTA 
and 10mM HEPES-KOH, pH7.2. We measured the fluorescence lifetime in 
response to increased free [Ca”*] by adding 0.5mM CaCl, and then reduced 
free [Ca**] by adding 2 mM EGTA. 

Immunofluorescence imaging. Slices were transfected with green-Camuic (or 
mEGFP-CaMKIla) and empty vector, which substitutes mCherry. In parallel 
experiments shRNA against CaMKIIo (35 jig) and mEGFP (16 jig) were used to 
verify the specificity of immunostaining. Slices were fixed in 4% paraformalde- 
hyde plus 4% sucrose (in 0.1 M phosphate buffer) for 15 min, followed by incuba- 
tion in 30% sucrose for 15 min at room temperature. Slices were then re-sectioned 
using a vibratome at a thickness of 25 jum. The sections were permeabilized and 
blocked overnight at 4°C in 1% Triton X-100 plus 5% normal goat serum, 
followed by 30 min blocking in 10% goat serum (in 0.1 M phosphate buffer). 
Immunostaining was performed for 24 h at 4 °C using as primary antibody mouse 
anti-CaMKIIo (GeneTex Inc.) diluted 1:400 in 5% goat serum in PBS-T (PBS plus 
0.1% Triton X-100). After three washes (20 min each) in PBS-T, sections were 
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incubated with a Texas-red conjugated goat anti-mouse secondary antibody 
(Invitrogen/Molecular Probes) diluted 1:400 in 10% normal goat serum in 
PBS-T. The immunofluorescence was quantified under two-photon microscopy 
in the cell body excluding the nucleus (Supplementary Fig. 2). The value was 
subtracted by the dark noise of photomultiplier. There was a few per cent of 
bleedthrough of mEGFP fluorescence to the red channel. To correct this effect, 
we measured the ratio (r) between the red and green fluorescence of mEGFP 
fluorescence using a purified mEGFP. Then, the bleedthrough was corrected as: 
Trexas-red = Tred — "greeny in which Ipeq and Fyexas-rea ate the immunofluorescence 
before and after the correction, and Igreen is MEGFP fluorescence in the green 
channel. 

Immunoprecipitation. HEK-293 cells were transfected with green-Camuia and/ 
or CaMKII using Lipofectamine 2000. Cells were lysed in a buffer containing 
20mM _ Tris-HCl, pH7.4, 137mM NaCl, 0.5% deoxycholic acid, 25mM 
B-glycerophosphate, 1mM sodium orthovanadate, 2mM sodium pyropho- 
sphate, 2mM EDTA, 1 mM sodium fluoride, 0.5 mM dithiothreitol, EDTA-free 
protease inhibitor cocktail (Roche) and incubated for 30 min at 4 °C. The super- 
natants were collected after a 30 min microfugal centrifugation at 14,000gat 4 °C. 
After precleaning with protein A-Sepharose (Sigma), samples were incubated 
with 5g per sample of mouse monoclonal anti-Green Fluorescent Protein 
antibody (MBL) or mouse non-immune IgG (Jackson Laboratories) at 4°C 
overnight. The immunocomplexes were precipitated with protein 
G-Sepharose beads (Sigma) for 1h at 4°C and analysed by western blotting 
(Supplementary Fig. 9). 

Fluctuation correlation spectroscopy. Fluctuation correlation analysis was per- 
formed under a two-photon microscope with a Ti:Sapphire laser tuned at 
900nm at 25-27°C*. Fluorescence signal was detected with a PMT 
(Hamamtsu), and acquired with a PCI board (Becker-Hickl). HEK-293 cells 
were transfected by Lipofectamine 2000, allowed to express for 2 days, lysed in 
a solution containing 40 mM HEPES, pH 8.0, 0.1 mM EGTA, 5mM magnesium 
acetate, 0.01% Tween-20, and diluted 10-100 times in a solution containing 
10mM Na-HEPES, pH 7.4, 150 mM NaCl and 0.1 mM EGTA. 
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Design and engineering of an O> transport 
protein 


Ronald L. Koder’*¥, J. L. Ross Anderson'*, Lee A. Solomon’, Konda S. Reddy’, Christopher C. Moser’ 
& P. Leslie Dutton! 


The principles of natural protein engineering are obscured by overlapping functions and complexity accumulated through 
natural selection and evolution. Completely artificial proteins offer a clean slate on which to define and test these protein 
engineering principles, while recreating and extending natural functions. Here we introduce this method with the design of an 
oxygen transport protein, akin to human neuroglobin. Beginning with a simple and unnatural helix-forming sequence with 
just three different amino acids, we assembled a four-helix bundle, positioned histidines to bis-histidine ligate haems, and 
exploited helical rotation and glutamate burial on haem binding to introduce distal histidine strain and facilitate O2 binding. 
For stable oxygen binding without haem oxidation, water is excluded by simple packing of the protein interior and loops that 
reduce helical-interface mobility. O2 affinities and exchange timescales match natural globins with distal histidines, with the 


remarkable exception that O2 binds tighter than CO. 


It has long been recognized that natural selection and evolution build 
complexity into natural proteins and biological systems'*. This com- 
plexity frustrates biochemists seeking to understand structure and 
function‘ and presents an extraordinary challenge to protein engineers 
who aim to reproduce or create new functions in proteins. So far, this 
complexity has severely constrained the ability of protein engineers to 
approach the efficiency of natural protein catalysts>"°. However com- 
mon it may be in nature, we maintain that complexity is not an 
essential feature of protein as a material, nor is it an essential feature 
of catalysis, as shown by synthetic chemical systems'!. By understand- 
ing the origins of complexity and making purposeful efforts to sepa- 
rate multiple utilities and minimize complexity during the design and 
testing of artificial proteins that are completely independent of natural 
selection, we show how to progressively build in sophisticated bio- 
chemical features that reproduce and exceed natural protein function. 
Our approach follows that long used by artists and architects who 
develop maquettes-simple models that are progressively altered to 
test and determine the ultimate characteristics of their constructions. 

Protein function requires more than a static structure’. In natural 
proteins, the motion that is part of engineering of protein function is 
often specific, which can make re-engineering motion for new func- 
tions prohibitively difficult. In contrast, artificial proteins offer a full 
palette of motions that engineering can edit to facilitate those that are 
productive and remove those that are unproductive. 

The engineering-based design of functional synthetic proteins pro- 
gresses through four stages (Fig. 1): (1) assembly of a simple, robust 
generic protein framework, such as a helical bundle, of appropriate 
size to sustain eventual cofactor binding and catalytic function; (2) 
insertion of cofactor-binding amino acids, keeping the number of 
amino acid changes low to control complexity; (3) adjusting the 
sequence for improved structural resolution; and (4) iteratively test- 
ing, redesigning and adding engineering elements to refine function. 


Assembly of scaffold and cofactors 


Figure 1 describes our starting maquette comprising polar glutamate 
(E) and lysine (K), as well as non-polar leucine (L). These have high 


a-helix-forming propensities’? which, when arranged in a near- 
repeating heptad sequence LEELLKKLEELLKL, spontaneously 
assemble into a water-soluble four-«-helical bundle with glutamate 
and lysine exposed and leucine buried in a molten globular interior". 
To make a bundle of a length typical of natural proteins, we used 
almost four heptad near-repeats (1 in Fig. 1). Such a bundle is free to 
associate with helices oriented parallel or anti-parallel. Adding an 
amino-terminal CGGG sequence for disulphide-mediated dimeriza- 
tion restricts helical topologies to syn or anti. Completing this stage, 
we replaced an internal leucine at e-position 7 of each helix with 
tryptophan to facilitate optical detection of the protein. 

In the second stage of design, we found that replacing leucines at 
internal a-positions 10 and 24 with histidine was sufficient to anchor 
up to four haems in the bundle’’. We also replaced a-position leucine 
17 between the haems with phenylalanine, an amino acid commonly 
found near haems in natural proteins. To allow us to discriminate 
between the haem-binding sites, we replaced another interior leucine 
with arginine (2 in Fig. 1). No effort was made to design core cavities 
that accommodate haem; reliance was put on interior histidine—iron 
ligation to position the haem in a malleable hydrophobic interior. 

Structural resolution is helpful in keeping progressive design on track. 
Singular structures in these bundles can be engineered without com- 
putation; introduction of B-branched aliphatic or aromatic residues 
along with a polar bond across helices provided by the histidines (3 in 
Fig. 1) confers tertiary structure to the four-o-helix interior of the apo 
protein as seen by NMR and X-ray crystallography'®””. 

Although various functions can be added to this basic haem-binding 
protein, for oxygen transport we elected to simplify the design by 
lowering the haem capacity from four to two by replacing histidine 
at position 24 with phenylalanine. Inspection of model apo structures 
showed that a substantial rotation of >50° around the helical axes was 
required on haem binding to accommodate the histidine rotamers 
typical of natural bis-histidine haem-binding proteins (Supple- 
mentary Fig. 1)'*. This rotation exports hydrophobic interior residues 
into, and imports polar residues from, the aqueous phase. Modelling 
identified four amino acids for substitution to interfacially compatible 


'The Johnson Research Foundation, Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA. +Present address: Department of 


Physics, The City College of New York, New York 10031, USA. 
*These authors contributed equally to this work. 
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|. Assemble appropriate sized bundle Qe 


abcdefg 


Generic 3-amino-acid sequence 


ll. Insert cofactor-binding amino acids 


2 CGGG ELWKLHEELLKKFEELLKLHEERLKKL | 
Constrain topology with loops; add W as spectroscopic tag; 
add H to bind haem; break haem symmetry with R 


Ill. Improve structural resolution 


3 


SSi 


Remove excess haem sites; improve holo helical register 


5 CGGG EIWKQHEDALOKFEDALNOQFEE-LKOL 


Diversify surface residues to assist NMR analysis 


IV. Iteratively add and refine functions 


Link loops, constrain dynamics/water access 


‘SS! 


Figure 1| The design of an artificial oxygen transport protein (6). Design 
begins with an extremely simple heptad near-repeat sequence using just 
three amino acids (1). Blue highlight marks heptad divisions. Design 
progresses through stages of amino acid changes (red) and trial 
intermediates (2—5) that are tested to reveal functional properties and to 
clarify the roles of individual amino acids. In 2—5, helical sequences shown 
are linked by cysteine disulphide loops and self-assemble in four-helix 
bundles, whereas in 6 a longer loop (red) unites two identical sequences, 
with the loops themselves now disulphide-linked, as shown to the right. 


alanines or glutamines’? and one for deletion to more favourably 
realign the binary pattern after haem addition, yielding 4 (Fig. 1). 
However, three inwardly rotating b-position glutamates at positions 
11, 18 and 25 were deliberately left in place'* to apply strain to weaken 
one of the two histidine—haem iron ligation interactions, as occurs in 
neuroglobin. This creates an entatic state*’. Fourier transform infrared 
spectroscopy (FTIR) shows that these strain-inducing glutamates have 
pK values (that is, pH values at which the residues are half protonated) 
increased by more than two units (Supplementary Fig. 2) and, as in 
natural haem proteins, are strongly electrostatically coupled to haem 
oxidation—reduction, changing the pK by more than three units”! 
NMR showed that addition of two haem Bs transformed unstructured 
apo-4 into a well-defined tertiary structure” (Supplementary Fig. 3). It 
appears that the histidine—iron polar bond can also provide a 
nucleus for interior packing that promotes a singular structure around 
different porphyrin cofactors”. To ease NMR structure determination 
and assignment of 90% of the peptide backbone (Supplementary Fig. 
4), external residues were also diversified at this time, yielding 5 (Fig. 1). 


Exclusion of water for oxy-ferrous haem stability 


None of the molecules discussed to this point, indeed no artificial haem 
proteins so far examined**”*, bind O, stably at room temperature 
(23 °C). This is not because the oxy-ferrous haem cannot be formed. 
Structures 2, 3, 4 and 5 all include the glutamate-based helical strain, 
which weakens the histidine ligation and allows competition from 
other ligands. By first exposing these haem proteins to CO and then 
illuminating at a low temperature to initiate ligand exchange for Oo, as 
was done previously with cytochrome oxidase”, each forms an 
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oxy-ferrous haem at near 100% yield at — 15 °C. The probable reason 
for room temperature instability is the access of water and protons to 
the haem-binding site, which facilitates electron transfer from ferrous 
haem to oxygen. This is clear from mutant studies of globins and the 
relative stability of the oxy-ferrous haem in chemical systems in water- 
free solvents''?””8. 

To engineer water exclusion we constrained the culpable protein 
motion. The crystal and NMR structures of 3 reveal that one of the two 
interhelical interfaces, that which lies between the helices not con- 
strained by haem binding, has an unusually low degree of surface 
complementarity and a high degree of inter-helix motion. Indeed, 
sequence 2 can be modified to introduce large-scale syn—anti flip- 
ping”. To constrain motion, the loops were reconfigured to link the 
helices across the most mobile interface. This also allowed the loops to 
be linked into a monomeric ‘candelabra’ structure, further constrain- 
ing motion (6 in Fig. 1). NMR confirmed the water-restricting effects 
of loop reconfiguration and monomerization. Hydrogen isotope (H/ 
D) analysis of hydrogen exchange protection factors of 4 and 6 at pH 7 
and 25 °C (Supplementary Fig. 5) shows that water-mediated proton 
exchange is complete within 15 min for 4 and is much slower for 6 with 
several core residue backbone amides, including some close to the 
haem, exchanging on a timescale of several hours. 

Maquettes 2—6 display ferric and ferrous haem visible spectra 
indicative of six-coordinate bis-histidine ligated haem B, characteristic 
of cytochrome b, deoxy-neuroglobin and cytoglobin, and quite distinct 
from the five-coordinate myoglobin and haemoglobin. Because the 
dissociation constant (Ky) for binding of the first haem (<1 nM) is 
much tighter than that for the second haem (50nM)*, we simplified 
spectral analysis by binding one haem per bundle (Fig. 2a). NMR 
assignments unambiguously identified the first haem B to bind at H7 
positions at the open end of the candelabra structure of 6. Only ferrous 
haem 6 shows rapid and complete conversion of the ferrous haem into 
the oxy-ferrous haem with a half time of ~50 ms measured by stopped- 
flow spectroscopy. This oxy-ferrous spectrum is remarkably similar to 
that of native neuroglobin (Supplementary Fig. 6). The oxy-ferrous 
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Figure 2 | Haem maquette spectra. a, b, At —15 °C, these spectra are stable 
for more than an hour: oxidized (green), reduced (blue), carboxy-ferrous 
(black) or oxy-ferrous (red) artificial oxygen transport protein 6 with either 
haem B (a) or haem A (b) as the cofactor. c, d, Stopped-flow spectral changes 
for mixing the reduced haem B proteins with oxygen at 15 °C. The fully 
designed oxygen transport protein 6 (c) shows the transformation of the 
reduced haem (blue) to the oxy-ferrous state (red), which eventually 
becomes oxidized (green), whereas the early intermediate 2 (d) proceeds 
directly and rapidly to the oxidized form. 
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Figure 3 | Modelling kinetics of haem ligand binding and release. a, Haem 
binding rotates helices but incurs strain by burying glutamates (red). Blue 
and pink indicate, respectively, the N- and C-terminal ends of the helices. 
Like some natural haemoglobins, at least two conformations are present; the 
closed form cannot bind CO. Histidine (His) release with helical rotation 
precedes gaseous ligand binding. b, Immediate CO photolysis difference 
spectra (4 [1s minus 0 |1s, red) of 6 haem B is followed by a rapid microsecond 
relaxation (yellow) followed by slower histidine and CO binding (green) and 
even slower displacement of histidine by CO (blue). Subsequent biphasic 
recombination kinetics (characteristic times t, and t,) at 418 nm (c¢) and the 
product (top) and sum (bottom) of these characteristic times as a function of 
CO concentration (d) determine histidine on/off rates. 


state is stable for tens of seconds before single electron transfer from 
ferrous haem to O, appears to generate superoxide. 

A similar experiment with CO shows rapid displacement of one 
histidine to generate an indefinitely stable carboxy-ferrous haem 
species in a 400-ms half time (not shown). After CO photolysis, 
CO rebinding to 6 is multiphasic (Fig. 3 and Supplementary Fig. 7), 
as observed in the hexacoordinate globins; detailed analysis of these 
data allows the estimation of the histidine on- and off-rates (Table 1). 
CO rapid mixing shows slower binding than expected from photo- 
lysis, indicating the presence of at least two conformations that differ 
in CO accessibility, termed ‘open’ and ‘closed’ in analysis of hexacoor- 
dinate haemoglobins”. Saturation of the observed CO binding rate 
allows estimates of the opening and closing rates (Supplementary Fig. 
8). Competition between CO recombination and oxidation by added 
ferricyanide also allows an estimation of the rate of thermal CO 
dissociation. Table 1 presents the kinetic constants (also shown in 
Fig. 3a) and equilibrium dissociation constants that have been deter- 
mined for 6 and contrasts them with several natural oxygen-binding 
proteins. Whereas the oxygen off-rate is similar to that of human 
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neuroglobin’', the on-rate is almost 100 times slower and resembles 
that of Ascaris haemoglobin. These artificial proteins are robust and 
adaptable enough to redesign for the testing of the hypothesis that a 
large hydrophobic pocket, as in neuroglobin”’, can speed O binding, 
whereas proximal strain, as in Ascaris haemoglobin’, slows O, binding. 


Testing engineering elements 


To test whether the helical rotation model of histidine strain was indeed 
operating to promote CO and O, binding, we changed the b-position 
glutamates of 6 to alanines. As anticipated, CO binding slowed by more 
than an order of magnitude (Supplementary Fig. 9). Moreover, O2 
failed to form detectable oxy-ferrous haem. Instead, the haem under- 
went oxidation within the mixing time, plausibly by means of an outer- 
sphere electron transfer. Thus, removal of the modelled entatic state” 
by conversion of interfacial glutamates to alanines disables the histidine 
for Oz or CO exchange-gated rotational mechanism. 

In exploring the effectiveness of loops to control mobility of free 
helices and thereby exclude water from the interior, we found that the 
oxy-ferrous haem is not stabilized by looping together helices already 
linked by haem (5), but is stabilized by looping together helices not 
linked by haem (6). Disulphide linkage of the loops in 6 proves 
unessential, because eliminating the disulphide through a Cys-to- 
Ser substitution maintains oxy-ferrous haem stability. 

Our development of helical strain and motion constraint to promote 
O, binding to ferrous haem is not confined to one site, but extends 
along the four-helix bundle to haems at other positions. A di-haem 
variant of 6 with histidine at position 7 and the complementary position 
42, or a single-haem variant with histidine only at position 42, each 
display oxy-ferrous haem properties similar to that observed in 6 
(Supplementary Fig. 10). 


Haem and substrate specificity 


The interior of 6 adopts a unique structure not only around haem B 
but also around other porphyrins including haem A (Supplementary 
Figs 11 and 12). Haem A has markedly different peripheral substitu- 
tions (Fig. 2b), which result in different redox and spectral properties. 
The oxidation-reduction midpoint potential at pH 8 (£,,3) measured 
versus the normal hydrogen electrode (NHE) of bound haem A is 
—100 mV (data not shown). This is about 200 mV more positive than 
haem Band hence less favoured to reduce O3 toO,”. Accordingly, the 
oxy-ferrous haem A (spectrally analogous to the cytochrome oxidase 
Compound A described previously*®) persists longer, about half a 
minute, before electron transfer. 

In all natural haemoglobins with distal histidines, either preferen- 
tially bound to the haem iron, as in neuroglobin, or displaced from 
the iron, as in myoglobin or human haemoglobin, CO is a poison that 
binds more tightly than O2. Our work shows this is not an essential 
property. The binding of CO to 6 is weaker than to natural globins 
(Table 1). This results in a net tenfold discrimination favouring Oz 
binding over CO binding, the largest observed for any distal histidine 
haem—protein complex, and comparable to distal tyrosine sites 
with extreme O, affinity. It seems probable that the strained, distal 


Table 1| Haem iron ligand on- and off-rates and equilibrium constants in natural and artificial proteins 


Haem protein Ligation kuis on (S*) Kitis ot keo on (HM-*S~*) keo ot (S*)  Kaco (nM) —ko2 on (HM *8°*) Kozo (S*) Kao2(nM) ~~ Kao2/Kaco 
(s-*) 

6 (haem B) bis-His 310 17 0.95 (0.006) 0.034 36 (6,000) 0.31 (0.017) 0.1 32 (600) (0.1) 

Neuroglobin*+ bis-His 2,000 45 65 (0.15) 0.014 0.21 (93) 250 (0.56) 0.8 3.2 (1,400) (15) 

Myoglobin*” His-aquo n/a n/a 0.5 0.019 37 14 12 860 23 
(distal His) 

Human haemoglobin*® His-aquo n/a n/a 2.2 0.009 4 19 15 770 190 
(distal His) 

Ascaris haemoglobin’? His-aquo n/a n/a 0.2 0.018 90 15 0.004 3 0.03 
(distal Tyr) 

Microperoxidase°? His-aquo n/a n/a 20 0.01 0.5 

Combinatorial bundles** = Mixed ? ? (3-11) 0.03-0.11 (6-25) 

Apparent binding rates (in parentheses) are slowed by bis-histidine (bis-His) ligation. Corresponding rates not in parentheses are estimated binding rates without histidine impedance facilitating 

comparison with proteins with different fractions of bis-histidine ligation. His-aquo has water as the sixth iron ligand. n/a, not applicable; ?, unknown. 
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histidine in 6 remains available to stabilize bound oxygen, as is seen in 
haemoglobins, myoglobins and horseradish peroxidase**. 


Protein engineering to control complexity and utility 


Management of complexity and multiple-utility is critical to engineer- 
ing protein design. Natural proteins accumulate complexity in a 
manner analogous to that described in genetic systems as ‘Muller’s 
ratchet’: a change is made, typically for minor or even no selective 
advantage, that becomes essential as new changes come to depend on 
this old change. The resulting interdependency between amino acids is 
the source of protein complexity. Not only does one amino acid 
commonly affect the function of nearby or allosterically remote amino 
acids, any amino acid typically supports multiple functions or utilities, 
in a way analogous to the organism scale description of Darwin'”>”*. 

Amino acid interdependency and multiple-utility of amino acids 
presents a serious challenge to re-engineering a natural protein for a 
new purpose. When more than a few amino acids are ‘rationally’ intro- 
duced to natural proteins by site-directed mutagenesis, protein stability 
and activity are often deeply compromised*’. Limited adaptability of 
natural proteins to accommodate new catalytic site functionalities has 
been the experience so far with a variety of powerful approaches includ- 
ing abzymes*’, computational design combined with directed evolu- 
tion® and recombination through domain swapping”. 

Maquette design that begins with simple protein sequences never 
exposed to natural selection and tests the effects of amino acid changes 
through the progressive stages of design does not eliminate complexity 
or multiple utility, but allows them to be more clearly seen and 
moderated. This approach offers reconciliation of each amino acid 
with specific roles in structure and function, the choice to eliminate 
redundancies, and the freedom to organize assembly and mechanistic 
elements while incorporating the next stage of development. As a 
result, it takes relatively few amino acid changes to incorporate the 
function of oxygen transport into a generic three-amino-acid bundle. 
The fact that such a modest redesign succeeds supports the view that 
many protein-engineering elements do not require atomistic precision 
and that exacting mimics of natural protein sites is neither necessary 
nor good engineering. Indeed, despite the widespread fragility of 
enzymes to multiple active site mutations, it appears that relatively 
simple and unsophisticated engineering principles underlie natural 
protein functions such as electron transfer*® and even catalysis*”’. 

One of these key engineering principles is the creation and control 
of protein motion. The long-range helical motion and strain that 
facilitates ligand exchange for oxygen transport, while still permitting 
relatively secure haem binding by means of bis-histidine ligation, 
would not have been possible merely by targeting the static structure 
of the final oxy-ferrous state in a natural haem protein. Any static 
structure-focused design®’** fails to address the molecular choreo- 
graphy often critical for function. In contrast, liberal motion is equally 
unworkable. Our balance of allowing inter-helical protein motion but 
constraining it through bonded and loop contacts allows small ligands 
to penetrate and bind while excluding the water that so profoundly 
affects active site properties. In different maquette variants, limiting 
water access slowed the rate for haem-oxygen electron transfer by four 
orders of magnitude, suggesting the raising of a catalytic barrier by 
>5 kcal. 

By keeping the maquette design simple and complexity low, we 
maintain the capacity to intentionally insert more complex functional 
elements such as allostery and cooperativity. The helical rotation built 
into this system introduces a long-range negative cooperativity in the 
two haem-binding sites. Versions of 6 that modify the histidines at one 
or the other binding site clearly show that on their own both haem- 
binding sites have high affinity, whereas, together, rotation on binding 
the first haem weakens the second. Such long-range helical coupling is 
a prospective means to build a maquette with positive cooperativity 
between the haem sites for oxygen binding, recreating the vital oxygen 
transport regulation for which haemoglobin is so well known. 
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The creation of the first completely artificial oxygen transport 
protein allows us to reconsider the design of natural oxygen transport 
globins. Despite the common view that natural globins are full of 
exquisitely refined functional properties reflected in the globin fold’, 
it is clear that transport function is readily achieved without such a 
fold. Indeed, the results presented here reinforce the developing view 
that the physical chemistry of haem in an oxygen transport protein 
can accommodate a wide range of histidine, O, and CO ligand 
exchange rates including, as we have seen here, the preference for 
O, over CO binding. The discrimination favouring O, over CO 
binding comes about without added engineering. Taking into 
account the impedance from the 95% bis-histidine ligation in 6 
expected from the histidine on- and off-rate ratio, binding of CO 
and O, ligands are both relatively slow compared to most natural 
globins, but the thermal release rates are towards the fast and slow 
ends of the natural range, respectively (Table 1)**. Traditionally, slow 
O, off-rates are explained in terms of nearby polar residues providing 
a hydrogen bond to the O,; simple modelling of 6 shows the distal 
histidine could play this part here. However, no special design was 
required beyond giving this histidine sufficient mobility by means of 
the helical strain arising from glutamate burial. The ease with which 
globin-like properties can be reproduced in a completely unrelated 
and simply engineered maquette indicates that the relatively complex 
globin fold is for the most part unremarkable, and may be common 
in nature not because of a uniquely capable design for oxygen bind- 
ing, but simply because it is good enough. 


METHODS SUMMARY 


Protein expression, purification and confirmation of purity were performed as 
described previously’. CO rebinding after photolysis by a frequency-doubled YAG 
laser at 532 nm, ~1-ns pulse width (Fig. 3c), is multi-exponential, a characteristic 
signature of human neuroglobins”’. Detailed analysis of time-resolved spectra in 
the flash experiments is consistent with a rapid exponential phase due to a change 
in pentacoordinate haem conformation (see Fig. 3b), similar to that reported 
for myoglobin**. After this relaxation, slower biphasic kinetics are associated with 
direct CO and histidine binding, followed by final histidine—CO exchange. Using 
the kinetic model of Hargrove’ to analyse the CO-concentration-dependence of 
the CO binding rates on flash photolysis allows the estimation of histidine on- and- 
off rates and per cent bis-histidine ligation for ferrous 6 (Fig. 3d). 

Predominantly mono-exponential binding kinetics of O, and CO with the 
ferrous haem B in 6 were followed spectroscopically in rapid stopped-flow mixing 
over gas concentrations from 2% to 50% saturation at 15 °C using an Olis RSM 
1000 spectrometer that scans both the alpha band and Soret band absorptions 
1,000 times a second followed by SVD analysis of the kinetic components. The 
dissociation rate of the CO complex was determined by competition with ferri- 
cyanide using a range of ferricyanide concentrations according to the ferric haem 
trapping method described previously“*. 

Preparations of ferrous haem synthetic proteins in CO saturated 30% ethylene 
glycol borate buffer at pH 9.0 are stable to ligand exchange on a timescale of 
several minutes in the dark at — 15 °C. This allows spectra such as those shown in 
Fig. 2 to be prepared by re-equilibration with O, or O,/CO gas mixtures followed 
by photolysis by halogen lamp illumination for several seconds to initiate rapid 
O, binding, as described previously”®. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Typical CO photolysis conditions were 101M synthetic protein in 20mM 
borate, pH 9.0, 100mM KCI and CO/Ar gas mixtures ranging from 4% to 
100% CO gas saturation with a slight excess of sodium dithionite monitored 
in the ultraviolet to ensure that the haem was reduced. 

Table 1 reports the direct, apparent CO and O; binding rates as well as estimates 
of the faster rates when not bis-histidine ligated. Apparent CO binding rates from 
photolysis are ~20 times faster than that reported by stopped flow, indicating that 
shortly after photolysis bis-histidine ferrous 6 may be in a relatively unrelaxed 
‘open’ state, prone to histidine dissociation and CO rebinding, compared to the 
dark, equilibrium, resting ‘closed’ state’®. 

Stable spectra of the oxy-ferrous form are also obtained by direct bubbling of 
ferrous haem protein with oxygen at — 15 °C without using CO as an intermediate. 
Competition between CO and O, ina range of mixtures of the two gases reaches 
equilibrium over 30 min after initial photolysis at —15 °C and indicates that O2 
binds with 10 times greater affinity than CO (for example, a 91:9 ratio of CO:0, 
gives 50:50 mix of bound CO and O;). 

Materials. Peptide synthesis reagents were purchased from Perseptive 
Biosystems. Haemin was from Fluka. The chloride salt of Fe* *-protoporphyrin 
III was synthesized by methods described previously*!. Deuterium oxide, '°N- 
ammonium chloride, '*C-glucose and (trimethylsilyl)-propionate were from 
Cambridge Isotope Laboratories. All other solvents and reagents were from 
either Fisher Scientific or Sigma. 

Peptide synthesis and purification. Solid-phase syntheses of 1—5 were performed 
on a continuous flow PerSeptive Biosystems as described previously. Crude 
peptides were purified to homogeneity by reversed phase Cjg HPLC using 
aqueous-acetonitrile gradients containing 0.1% (vol/vol) trifluoroacetic acid. 
Peptide homogeneity and composition were assayed by analytical HPLC and laser 
desorption mass spectrometry. 

The gene encoding 6 was designed using the program Amplify”’. The gene was 
constructed using assembly PCR” and ligated into the overexpression vector pET 
32d(+) (Novagen, Inc.) modified to replace the enterokinase cleavage site with a 
TEV-protease cleavage site, yielding the plasmid pHP7-TEV. The plasmid was 
transformed into DH5« cells and sequenced in the region of interest to verify the 
presence of the intact gene, and the plasmid was transformed into BL21(DE3) cells. 
Mutations to 6 were created using the Quick Change protocol (Stratagene, Inc.). 

For unlabelled expression, these were grown in TPP medium™ at 37 °C to a 
Dgoo of 1.0 and induced with IPTG at a final concentration of 1 mM for 2 h before 
collection. For '°N-labelled expression, cells were grown at 37°C in minimal 


nature 


media containing 1 gl! ‘SN’ ammonium chloride to a Deoo of ~1.0, induced 
with 1 mM IPTG, and shaken at 37 °C for an additional 3 h. For '°C,'°N-double 
labelling, cells were grown in minimal media containing 1% w/v '°C glucose, 
1gl! °N ammonium chloride and 8mll~’ '°C,!°N Bioexpress medium 
(Cambridge Isotope Labs). Cells were grown at 37 °C to a Deoo of ~1.0, induced 
with 1 mM IPTG, and shaken at 37 °C for an additional 2h. 

Cells were collected by centrifugation, broken open using a French press, and 

purified on a Ni-nitrilotriacetic acid column (Qiagen, Inc.) according to the 
manufacturer’s instructions. The fusion protein was dialysed into 50 mM Tris- 
HCl, 1 mM DTT, pH 8.0, and then cleaved overnight (15-18 h) with His,-tagged 
TEV protease (Invitrogen, Inc.). The reaction mixture was filtered through Ni- 
nitrilotriacetic acid resin, and the purified 6 concentrated by lyophilization. 
Purified peptides were dissolved in 20 mM K,HPO, and 100 mM KCl, pH 8.0, 
and cysteine residues were air-oxidized to the symmetric disulphides overnight. 
Disulphide formation was followed by analytical C,g HPLC. 
FTIR. 6 was washed by dilution with 100 mM potassium phosphate and 100 mM 
KCl at pH 6.0 or pH 8.0 and concentrated to ~1 mM with Vivascience centri- 
fugal concentrators. Benzyl viologen (100 1M) was added as a redox mediator 
and the sample was placed in a microelectrochemical chamber above an ATR- 
FTIR microprism (3 bounce silicon; 3 mm diameter; SensIR). The chamber hada 
5-mm-diameter glassy carbon working electrode within 1mm of the prism 
surface and the sample solution was connected to a counter electrode and an 
Ag/AgCl reference electrode via an ion-permeable frit. Potential was switched 
cyclically between 0mV and -450mV (versus SHE). Before each potential 
change, a reference scan was recorded. The potential was then switched and a 
sample spectrum was recorded after 10 min to allow for redox equilibration. 
Data at pH 6.0 average 50 reductive minus 50 oxidative cycles of 500 interfero- 
grams each at 4 cm’ resolution; data at pH 8 average 40 reduced minus oxidized 
cycles; the double difference spectrum pH 6 minus 0.58 pH 8 reflected appro- 
priate protein concentration-dependent weighting at the two pH values. 
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Femtosecond characterization of vibrational optical 


activity of chiral molecules 


Hanju Rhee’”, Young-Gun June’, Jang-Soo Lee’”, Kyung-Koo Lee'”, Jeong-Hyon Ha’, Zee Hwan Kim’, 


Seung-Joon Jeon’” & Minhaeng Cho’? 


Optical activity’? is the result of chiral molecules interacting 
differently with left versus right circularly polarized light. 
Because of this intrinsic link to molecular structure, the deter- 
mination of optical activity through circular dichroism (CD) spec- 
troscopy has long served as a routine method for obtaining 
structural information about chemical and biological systems in 
condensed phases**. A recent development is time-resolved CD 
spectroscopy, which can in principle map the structural changes 
associated with biomolecular function’ and thus lead to mechan- 
istic insights into fundamental biological processes. But implem- 
enting time-resolved CD measurements is experimentally 
challenging because CD is a notoriously weak effect (a factor of 
10-*-10~° smaller than absorption). In fact, this problem has so 
far prevented time-resolved vibrational CD experiments. Here we 
show that vibrational CD spectroscopy with femtosecond time 
resolution can be realized when using heterodyned spectral inter- 
ferometry to detect*° the phase and amplitude of the infrared 
optical activity free-induction-decay field in time (much like in a 
pulsed NMR experiment). We show that we can detect extremely 
weak signals in the presence of large achiral background contribu- 
tions, by simultaneously measuring with a femtosecond laser pulse 
the vibrational CD and optical rotatory dispersion spectra of 
dissolved chiral limonene molecules. We have so far only targeted 
molecules in equilibrium, but it would be straightforward to 
extend the method for the observation of ultrafast structural 
changes such as those occurring during protein folding or asym- 
metric chemical reactions. That is, we should now be in a position 
to produce ‘molecular motion pictures’ of fundamental molecu- 
lar processes from a chiral perspective. 

CD spectroscopy is considered an incisive tool for determining the 
secondary structure of proteins in solution*®!*'’. But biomolecules 
participating in chemical or biological reactions often undergo ultra- 
fast structural changes that modulate their chiro-optical properties. 
The desire to map the chirality changes of such reactive systems over 
time, as a means to gain insight into the underlying structural changes, 
thus naturally led to the development of transient electronic CD 
measurement techniques”’*'®. One approach to improve the rela- 
tively poor time resolution of electronic CD spectrometry and over- 
come the weak-signal problems uses an ellipsometric detection 
scheme”"™, but its time resolution limit is set by the (limited) speed 
of the measurement electronics, such as amplifying gates or transient 
digitizers. Another approach combines an ultrashort laser with an 
electro-optic modulator or a Babinet-Soleil compensator to measure 
electronic CD or optical rotatory dispersion (ORD)'*”® signals with 
subpicosecond time resolution. These approaches should in principle 
enable one to monitor time-resolved CD spectra by first optically 
triggering the chemical or biochemical reaction, and then measuring 


the differential absorption AA of time-delayed light pulses that are 
left- and right-circularly polarized (LCP and RCP) or left- and right- 
elliptically polarized (LEP and REP) (that is, AA=Arcp — Arcp or 
AA= Arp — Apggp). But the mode-locked ultrafast lasers used in such 
pump-probe measurements usually lack the intensity stabilities of 
approximately 0.001% that are needed to discriminate minuscule 
CD signals from the large achiral background signal. 

Vibrational circular dichroism (VCD) is the vibrational analogue 
of the more popular and extensively used electronic CD. It is useful 
for differentiating between discrete structural conformations of bio- 
molecules*’*'*"” and for determining absolute configurations of 
chiral molecules and drugs. Time-resolved VCD spectrometry could 
thus play a key role in unravelling the mechanisms of important 
biological or chemical processes, by monitoring the structural evolu- 
tion of biomolecules or chiral molecules during reaction. But time- 
resolved VCD experiments are considered even more challenging 
than electronic CD experiments, mainly because the chiral suscep- 
tibilities for nuclear vibrations are much smaller than those for elec- 
tronic degrees of freedom; consequently, neither an experimental set- 
up nor actual measurements involving ultrafast time-resolved VCD 
have been reported to date. Our approach to realizing femtosecond 
vibrational optical activity (CD and ORD) measurements involves 
detecting the phase and amplitude of the coherently emitted infrared 
optical activity free-induction-decay (FID) field (Fig. 1) using a novel 
cross-polarization detection scheme (Fig. 1a). 

Figure 1a sketches the key principles of our experiment. Molecules 
with opposite chiral properties absorb LCP and RCP light differently. 
Therefore, an excitation pulse of linearly polarized light—an equal 
superposition of LCP and RCP—is transformed into LEP or REP light 
on interacting with chiral molecules. The interaction with the chiral 
molecules will at the same time give rise to circular birefringence, so 
that RCP light is transmitted with a different velocity from LCP light: 
the molecules rotate the major axis of the polarization ellipse arising 
from the CD phenomenon by an amount given by their circular 
birefringence An, defined as the differential index of refraction of 
LCP and RCP beams (4n= nycp — ngcp). Particularly relevant here 
is that in the transmitted elliptically polarized light, information on 
the optical activity and hence chiral properties of the molecules is 
almost exclusively contained in the minor-axis field component 
(horizontal electric field component in Fig. 1a, E, ); in contrast, the 
major-axis field component (vertical electric field component, E)) is 
mostly influenced by all-electric-dipole-allowed achiral features of the 
molecule. As a result, a polarizer oriented perpendicular to the polar- 
ization direction of the excitation beam will select only the chiral (CD 
and ORD) components of the transmitted elliptically polarized light. 
It is the phase of the minor-axis field (E, ) that provides information 
on the handedness of the chiral molecule’*. We found that the 
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Figure 1 | Heterodyned detection of optical activity free induction decay. 
a, Sketch of the basic principles of the cross-polarization detection in our 
optical activity FID measurements. We first linearly polarize a coherent 
femtosecond light pulse with linear polarizer LP1. The pulse is then incident 
on the chiral sample, and the left- and right-circularly polarized components 
(LCP, RCP) of the linearly polarized field are differentially absorbed. 
Depending on the chirality of the sample, this differential absorption (circular 
dichroism, CD) transforms the pulse into one with an eccentric left- or right- 
elliptical polarization (LEP, REP). Furthermore, the circular birefringence of 
the chiral molecules causes an optical rotation of the major (vertical) axis of 
the field polarization ellipse created through CD. After passage through the 
sample, the light pulse passes through linear polarizer LP2. LP2 is oriented 
perpendicular to LP1, so most of the transmitted field parallel to the incident 
beam is rejected while the perpendicular field components are retained. These 
perpendicular components are out of phase with each other after interaction 
of the light with two different enantiomers, the (R)-form (blue) and (S)-form 
(red); the individual CD (solid lines) and ORD (dashed lines) components are 
approximately in quadrature if the optical rotation angle is small. b, Sketch of 
the set-up for Fourier transform spectral interferometry experiments. The 
input pulse is split in two by a beam splitter (BS1). One beam is linearly 
y-polarized by LP1 and used to create electronic or vibrational coherences in 
the chiral sample (C), while the other is used to amplify and characterize the 
emitted FID field. For measurements of both parallel and perpendicular 
components of the FID fields (EPP ) emitted from C, the transmission axes of 
the linear polarizers LPO and LP2 are aligned along the y- and x-axis, 
respectively. The reference pulse (ER )and the signal field are combined by 
another beam splitter (BS2), with the former preceding the latter by a finite 
time, Ty. The heterodyned signal field is dispersed by a monochromator (MC) 
and detected by detector D. 


relationship between E, (w) and Ej(«), where w is frequency, is given 
by theory as (see Supplementary Information for details) 


E, (w)cAy(o)k\(o) (1) 


where E)(c) is the complex transmitted electric field spectrum that 
results from the interference between the input field and the induced 
electric-dipole-allowed optical FID field. The complex function Ay(@) 
is the optical activity susceptibility; its imaginary part directly corre- 
sponds to the CD spectrum Aa(), and its real part to the frequency- 
dependent ORD spectrum. Although the minor-axis component 
E| (@) is extremely weak, the optical heterodyning allows us to amplify 
the weak signal field to give readily measurable levels of phase and 
amplitude information. Once the minor- and major-axis field compo- 
nents, E) (@) and B, (cw), are measured, from equation (1) the complex 
susceptibility Ay(@) can be directly obtained. 

Our femtosecond measurement technique of vibrational optical 
activity utilizes heterodyned Fourier transform _ spectral 
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interferometry*'°'"** as sketched in Fig. 1b (see Supplementary 


Information for more details regarding the experimental set-up). 
Briefly, we use an infrared pulse (1 kHz repetition rate, 2 uJ, centre 
wavelength of ~3.4um, and pulse width of ~60fs) generated by 
difference frequency mixing of a signal and an idler pulse from an 
optical parametric amplifier. To measure E , («), the linear polarizers 
LPO and LP2 are directed along the x axis while the linear polarizer LP1 
is oriented along the y axis. In this cross-polarization configuration, 
only the E, (@) component of the elliptically polarized light pulse that 
is transmitted through the sample is allowed to pass through LP2 for 
heterodyne detection. (A high-resolution motorized rotational stage 
is used to finely control the relative orientation of LP1 with incre- 
mental resolution of 0.0005°.) This FID signal and the reference pulse 
are then combined and spectrally dispersed with a monochromator, 
and the cross-polarization spectral interferogram S| (a) is detected. 
To measure the parallel-polarization spectral interferogram S)(q) that 
originates from £\(«), we rotate LP1 by a mere 0.5°; this alters the 
beam pathway and experimental set-up only minimally, and the phase 
change induced by the rotated linear polarizer is thus negligibly small. 
We find that the performances of the polarizers (LP1 and LP2) are 
critical for the success of the experiment because E| («) is typically 
several orders of magnitude smaller than E, (@) when LP1 || LP2. We 
therefore chose calcite plate polarizers” for the main polarizer pair 
that removes the large achiral signal contribution E\(): unlike usual 
prism-type polarizers (with extinction ratios of ~10~°), calcite plate 
polarizers utilize dichroic absorption that results in exceptionally 
small extinction ratios (10 °-10 °) within limited infrared spectral 
regions (3.35—3.5 tum, 3.9—4.1 um, and so on). 

To test both the cross-polarization detection scheme as a means for 
realizing femtosecond vibrational optical activity spectrometry 
(Fig. la) and our experimental set-up (Fig. 1b), we performed 
proof-of-principle measurements on the (R)- and (S)-enantiomers 
of limonene and their racemic mixture. Even though the working 
ranges of our calcite dichroic polarizers are limited, most of the C-H 
stretching mode frequencies of limonene fall within one of their 
spectral windows. Figure 2a shows the dispersed spectral interfero- 
grams (that is, S| (w) and S)(@)) measured in the 2,840 to 3,000 cm — * 
wavenumber range; all three spectral interferograms in Fig. 2a exhibit 
systematic differences in terms of oscillating amplitudes and phases. 
We note here that the heterodyne signal measured for the racemic 
mixture is finite (though relatively small) rather than zero; this signal 
is due to an optical imperfection of the linear polarizers, and origi- 
nates from interference between the reference pulse and a weak hori- 
zontal light pulse component leaked from LP1. Although not ideal, 
this achiral measured signal component contributes only to the VCD 
spectrum as a background offset (Supplementary Fig. 3) and can thus 
be removed. 

The CD and ORD spectra are obtained by transforming the 
measured spectral interferograms S,(q@) and S\(@) into the electric 
field spectra E,(@) and £E\(m) and then taking the ratio 
E\(@) / E\(@); the imaginary and real parts of this ratio then yield 
the CD and ORD spectra, respectively. The spectral characteristics 
(amplitude and phase) of the reference pulse and an additional phase 
term produced by the time delay tg between the signal field and the 
reference pulse (Fig. 1b) contribute equally to S; (@) and S\(@), so 
their contributions automatically cancel out when taking the ratio of 
E\(@) to E\(@). This means that as long as the two paths in the 
interferometer have a constant delay time tg and are sufficiently 
stable (in terms of phase fluctuations) during measurement, CD 
and ORD spectra can be retrieved from the measured interferograms 
without requiring a detailed characterization of the reference pulse or 
precise determination of the value of the delay time tg. This feature is 
crucial for significantly boosting the sensitivity of our method. 

Figure 2b depicts VCD spectra Ax(@) extracted from the spectral 
interferograms measured with samples of (R)- and (S)-limonene and 
their racemic mixture. The VCD peak positions of (R)- and (S)- 
limonene are identical and their signs opposite to each other, and 
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Figure 2 | Vibrational optical activity signals of chiral limonenes. a, Cross- 
polarization spectral interferograms S| (@) of (R)-limonene (blue line) and 
(S)-limonene (red line) and their 1:1 racemic solution (green line) dissolved 
in CCl,. (Analyte concentration, 1.2—1.5 M; path length, ~50 tm; maximum 
absorbance, 1.1 for (R)-limonene and racemic solution, and 1.4 for (S)- 
limonene.) Parallel-polarization spectral interferograms S\(c) are required 
to retrieve VCD and VORD spectra, and the inset therefore gives that of (R)- 
limonene. Coloured dots give the measured data points for comparison, with 
the dashed line indicating zero. Note that the three limonene samples have 
distinctive spectral phases and amplitudes, which are different from one 
another. b, VCD spectra retrieved from the measured spectral 
interferograms in a, with linear baseline correction to enable a clear 
comparison. Absolute VCD values (AA) were directly calculated according 
to (4/2.303)Im[E | ()/E|(@)] (Supplementary Information). Ab initio 
calculations using MP2/6-311++G** identified the three major (R)- 
limonene conformers A, B and C (top-left inset) and gave the simulated C-H 
stretch VCD spectrum shown in the top-right inset. c, VORD spectra 
extracted from the real part of Ay(@). The absolute VORD values (Ag) were 
also calculated by Re[E 1(@) / E \(@)| (Supplementary Information), and the 
maximum optical rotation angle (at 2,927 cm") found to be about 

1.6 X 10 * degrees. 
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the spectra are quantitatively consistent with spectra measured using 
a continuous wave Fourier transform infrared (FT-IR) VCD spectro- 
meter™ (Supplementary Fig. 5). This indicates that our technique is 
robust and reliable. The racemic spectrum does not show any notable 
structure, as expected. A series of concentration-dependent VCD 
measurements confirmed that the attenuation of the signal field by 
self-absorption does not affect the spectrum as long as the experi- 
mental optical density is close to or less than 1. We note that the data 
collection time needed to obtain the VCD spectra in Fig. 2b is only 
tens of minutes, whereas measurements with the commercial FT-IR 
VCD spectrometer required multiple hours of signal averaging. The 
enhanced sensitivity of our method is due to the fact that the 
measurements are non-differential, free from achiral background 
signal, and based on heterodyned detection. 

Finally, Fig. 2c shows the vibrational ORD (VORD) spectra of the 
(R)- and (S)-limonene and their racemic mixture obtained directly 
from the real part of Ay(@). The spectra illustrate successful location 
of a maximum in the angle of optical rotation of an infrared beam by 
(R)-limonene in solution that is as small as about 10 ° rad. We note 
that resonant VORD spectra of a liquid crystal” and off-resonant 
VORD spectra of polypeptides”® have been measured before; but the 
direct measurement of the resonant VORD spectrum ofa small chiral 
molecule in solution presumably posed significant experimental 
challenges due to the minute extent of optical rotation and the large 
attenuation of the signal in the resonant frequency region. To enable 
a comparison between theory and experiment, we carried out high- 
level ab initio calculations of an isolated (R)-limonene molecule and 
quantum mechanical/molecular mechanical simulations of limo- 
nene/CCl, solutions. The calculations identified three major confor- 
mers (Fig. 2b top left inset) and yielded a simulated VCD spectrum 
(Fig. 2b top right inset) in good agreement with experiment; this 
illustrates that the current quantum chemistry calculation method, 
when properly combined with molecular dynamics simulation tech- 
niques, is able to provide a quantitative description of the absolute 
configuration of a given chiral molecule in solution?” ”’. 

Although we have not yet carried out time-resolved spectroscopy, 
the present work clearly establishes that femtosecond optical pulses 
can be used to simultaneously measure VCD and VORD spectra with 
a time resolution that is only limited by the optical activity FID time. 
Time-resolved CD (ORD) experiments using the current technique 
can therefore be implemented by adding a pump pulse to trigger 
chemical or physical changes (such as protein unfolding) and then 
using a femtosecond optical pulse to probe the subsequent relaxation 
process after varying waiting times, Ty. This would result in a series of 
VCD and VORD spectra as a function of T,,, and provide informa- 
tion on the conformational change of the target molecule. In most 
experiments so far, it was found that the pump-induced changes 
result in signals that are too weak to be discriminated from large 
static signals when trying to differentiate measured CD spectra at 
different times; thus, a pump modulation technique that allows 
one to measure only the pump-induced changes would be required 
to increase sensitivity and enable measurements. In contrast to 
previous time-resolved CD experiments relying on differential 
measurements, the present method does not require any polarization 
modulation of the probe pulse and should thus enable the develop- 
ment of a highly sensitive time-resolved CD version in a straightfor- 
ward manner. 

Our phase-and-amplitude measurement technique overcomes 
both the small-signal and non-zero-background problems of con- 
ventional electronic and vibrational CD techniques relying on differ- 
ential absorption measurement, owing to its capability of coherent 
amplification of a weak signal field. Although the present work 
demonstrated infrared CD and ORD measurements, the key con- 
cepts and the technique are sufficiently general that they can be 
readily applied to the measurement of other related chiro-optical 
properties. We therefore anticipate that the present method, which 
is a CD analogue of pulsed NMR, will trigger novel developments to 
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further improve time-resolved optical activity spectroscopy for use in 
studies of biomolecular dynamics in aqueous solution at physio- 
logical conditions. 
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Hybrid organic-inorganic rotaxanes and 


molecular shuttles 
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The tetravalency of carbon and its ability to form covalent bonds 
with itself and other elements enables large organic molecules with 
complex structures, functions and dynamics to be constructed. The 
varied electronic configurations and bonding patterns of inorganic 
elements, on the other hand, can impart diverse electronic, mag- 
netic, catalytic and other useful properties to molecular-level struc- 
tures. Some hybrid organic—inorganic materials that combine 
features of both chemistries have been developed, most notably 
metal-organic frameworks', dense and extended organic— 
inorganic frameworks” and coordination polymers’. Metal ions 
have also been incorporated into molecules that contain inter- 
locked subunits, such as rotaxanes*’ and catenanes®*®, and struc- 
tures in which many inorganic clusters encircle polymer chains 
have been described’. Here we report the synthesis of a series of 
discrete rotaxane molecules in which inorganic and organic struc- 
tural units are linked together mechanically at the molecular level. 
Structural units (dialkyammonium groups) in dumb-bell-shaped 
organic molecules template the assembly of essentially inorganic 
‘rings’ about ‘axles’ to form rotaxanes consisting of various 
numbers of rings and axles. One of the rotaxanes behaves as a 
‘molecular shuttle’’®: the ring moves between two binding sites on 
the axle in a large-amplitude motion typical of some synthetic 
molecular machine systems''. The architecture of the rotaxanes 
ensures that the electronic, magnetic and paramagnetic character- 
istics of the inorganic rings—properties that could make them suit- 
able as qubits for quantum computers'*"*—can influence, and 
potentially be influenced by, the organic portion of the molecule. 
The basis for the hybrid organic—inorganic rotaxane synthesis lies in 
the observation”? that the formation of heterometallic rings of various 
shapes and sizes, containing seven or more trivalent Cr(1I) ions and 
one or two divalent metal ions (typically Ni(11), Co(1), Fe(m) or Cu(11)) 
bridged by multiple fluoride and alkyl/aryl carboxylate anions, is 
templated by various organic cations”, including imidazolium’, 
N-alkylimidazolium*' and primary and secondary’? ammonium 
groups. Dialkylammonium salts have previously been used to direct 
the assembly of rotaxanes based on crown ethers”, cucurbituril”>”° 
and cyclic peptides’’, so it seemed possible that they could also 
template the formation of rotaxanes using heterometallic rings. 
Secondary amine threads (la—c) were constructed with bulky ‘stoppers’ 
at each end of the axle to prevent subsequent de-threading of the 
heterometallic ring assembled around the ammonium template. The 
threads were reacted’? with a 7:1 molar ratio of Cr(i):Co(1) ions in 
the form of chromium(im) fluoride (CrF3°4H,O) and a cobalt(1) 
pivalate salt ([Co2(H2O)(#BuCO .)4(t#BuCO H),4]; tBu, tert-butyl), 
with pivalic acid as the solvent, at 140 °C for 12h (Fig. la). Although 
no rotaxane was observed with the shorter threads (la and 1b), 
probably owing to there being insufficient space between the stoppers 


to accommodate the bulky heterometallic ring, we found that with a 
six-carbon-atom spacer between the ammonium centre and each 
stopper (thread Ic), [2]rotaxane 2c was produced in 23% yield, which 
is rather efficient for what is essentially a 33-component assembly 
process (each metal—ligand bond is dynamic under the reaction 
conditions). 

Confirmation of the interlocked nature of 2c was provided by the 
solid-state structure ofa single crystal of the [2]rotaxane grown froma 
saturated diethyl ether/acetonitrile solution (Fig. 1b). The [2] rotaxane 
is a neutral molecule, the positive charge on the ammonium thread 
being balanced by the charge remaining on a heterometallic macro- 
cycle consisting of 24 monoanionic ligands (eight fluorides and 
sixteen pivalate groups) bound to seven trivalent cations (Cr(im)) 
and one divalent (Co(1)) cation. The crystal structure shows the 
ammonium cation in the centre of the cavity of the macrocycle, form- 
ing short (~2.0A) F :**HN* hydrogen bonds with two of the 
bridging fluoride ligands. Mass spectrometry confirms that each 
rotaxane molecule contains exactly one cobalt and seven chromium 
ions (Supplementary Information), the crystallographic data indi- 
cating that the cobalt ion is disordered over the eight different metal 
sites in each ring in the solid state ('H NMR suggests that it also has no 
particularly favoured position in solution (see below)). 

The X-ray structure shows the eight metal centres in each hetero- 
metallic wheel to be almost perfectly coplanar, with the two faces of the 
ring almost identical in terms of physical shape and geometry. 
Nevertheless, the wheel is intrinsically chiral (that is, has a non- 
superimposable mirror image) because the one Co(II) ion must have 
either a (A) arrangement or a (A) arrangement of the ligands in its 
octahedral coordination sphere (see Supplementary Information for 
an explanation of this nomenclature). The chirality is particularly 
apparent in the unequal magnetic effect exerted on each geminal 
proton in the methylene groups of the thread as revealed by 'H 
NMR spectroscopy (see below). The sixteen pivalate groups form a 
lipophilic outer coating for the heterometallic ring, conferring good 
solubility of the rotaxane in a range of organic solvents, including 
toluene, dichloromethane, chloroform and diethyl ether. 

Comparison of the 'H NMR spectrum of the parent thread, 1c 
(Fig. 2a), with that of the rotaxane, 2c (Fig. 2b), and a partially 
deuterated analogue, dj44-2¢ (Fig. 2c, d), shows that the basic struc- 
ture found in the solid state is retained in C,D Cl, solution and also 
gives some information regarding the dynamics of the relative 
motion of the mechanically interlocked components. As one of the 
eight metal ions (Co(11)) is different from the other seven (Cr(m)) in 
each ring, if the heterometallic ring spun about the axis of the thread 
rapidly on the NMR timescale then the sixteen pivalate groups should 
experience eight non-equivalent sets of chemical environments— 
four for axial and four for equatorial pivalate groups—as observed 
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Figure 1| Synthesis and X-ray crystal structure of hybrid organic-inorganic 
[2]rotaxane 2c. a, Synthesis. ‘Bu, tert-butyl. b, X-ray crystal structure. In the 
framework representation of the X-ray crystal structure, carbon atoms of the 
thread are shown in grey, those of the pivalate groups in light blue, oxygen 
atoms in red, nitrogen in dark blue, fluorine in green, cobalt in purple and 
chromium in orange. Hydrogen atoms (white) other than those of the 
ammonium group are omitted for clarity. The Co(1) ion is disordered over 
the eight metal sites in each heterometallic wheel. Both enantiomers, (A) or 
(A) at the Co(11) ion, are present in the crystal. Hydrogen-bond lengths and 
N-H-F angles are NH1°**F1 = 2.03 A, NH2°*-F7 = 2.01 A, N-H1- 


F1 = 168.9° and N-H2-F7 = 168.2°, shown as dashed cylinders in the figure. 
The fluorine anion between the two involved in short NH: *:F hydrogen 
bonds is also close enough to both ammonium protons to provide some 
electrostatic stabilization of the intercomponent binding motif (that is, it 
participates in what can be considered long and poorly directional NH°**F 
hydrogen bonds, not depicted in the figure): NH1---F8 = 2.38 A, 
NH2::-F8 = 2.56 A, N-H1-F8 = 114.2°, N-H2-F8 = 101.7°. 
Crystallographic data and experimental details of the structural refinement 
for 2c are provided in the Supplementary Information. 
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Figure 2 | 'H NMR spectra (500 MHz, C,D,Cl,). a, Thread 1¢-PF, at 
330K. b, [2]rotaxane 2c at 330K. ¢€, dy44-2¢ at 330K. d, dj44-2¢ at 300K. 6, 
chemical shift. The eight signals shown in blue in b are due to the 48 pivalate 
methyl groups of the macrocycle (see text and Supplementary Information 
for an explanation of the splitting pattern). Per-deuterating the pivalate 


methyl groups facilitates characterization of the relatively low-intensity 
thread protons in the rotaxane (¢ and d). The rotaxane signals below 

—5 p.p.m. are shown at X7 magnification and on a compressed chemical- 
shift axis in comparison with the signals above —5 p.p.m. Residual solvent 
peaks and impurities are shown in grey. 
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for the parent macrocycle [Cr7CoFs(t#BuCO2)156*H30] (Supple- 
mentary Information). If, on the other hand, the ring were rotating 
slowly about the thread then it would become desymmetrized on the 
NMR timescale and many more non-equivalent sets of signals for the 
pivalate groups would be expected. Eight resonances for pivalate 
methyl groups (blue signals, Fig. 2b) are present for the [2]rotaxane 
at 330K in C,D,Cl, (500 MHz), and as cooling to 240K does not 
change this number, nor appreciably broaden the signals, it appears 
that the heterometallic ring rotates about the thread (randomly 
under thermal motion) very rapidly under these conditions. 

The most dramatic effects of interlocking the inorganic and organic 
components, however, are the very large shifts (up to 45 p.p.m.; 
Fig. 2d) in the thread protons caused by the paramagnetic Co(t) 
ion, the greatest shifts generally occurring for the protons closest to 
the heterometallic wheel. The presence of two signals for each pair of 
methylene protons is due to the two protons of each methylene group 
(labelled primed and unprimed in Fig. 2) being diastereotopic (that is, 
magnetically distinct) given the chirality of the heterometallic wheel. 
The magnitude of the shifts is highly sensitive to changes in tempe- 
rature (for example, there is a 6—8-p.p.m. change in the chemical shift 
of H, and Hy over a range of just 30 K; compare Fig. 2c with Fig. 2d). 

The hybrid organic—inorganic rotaxane synthesis was successfully 
extended to threads containing more than one ammonium binding site 
(Fig. 3). This enabled the preparation of significantly more complex 
architectures (Fig. 4), including multi-ring [3]- and [4]rotaxanes and 
dynamic molecular shuttles in which the ring moves randomly under 
thermal motion between different binding sites on the thread. 

With a long (12-carbon-atom) spacer between two ammonium 
groups on the thread (3b), reaction under the previously established 
conditions (7:1 CrF3°4H,O:[Co(H,O)(tBuCO,)4(t#BuCO H),], 
t-BuCO>H, 140 °C, 12h) afforded [3]rotaxane 5b in 45% yield (from 
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t tay CuCO,/Cu(OH) 
Bu Bu 
ONO eee 
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'BuCO,H (solvent), 140 °C, 12-16 h 
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‘Bu 6b, n = 8, 37% 


Figure 3 | Synthesis of hybrid organic-inorganic 

[3]rotaxane 5b, [4]rotaxane 6b and molecular shuttle 4a. Compound 

5b has two rings on one thread, 6b has two rings each circumscribing two 
threads and 4a is a [2]rotaxane in which the ring moves randomly between 
the two ammonium sites on the thread, restricted by the tight ion-pairing 
and intercomponent NH-°::F hydrogen-bonding. 
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Figure 4 | X-ray crystal structures. a, [3]rotaxane 5b. b, A pseudorotaxane 
featuring the bisammonium binding-site arrangement present in molecular 
shuttle 4a. c, [4]rotaxane 6b. The atom representations are as indicated in 
the legend of Fig. 1b, except that here the organic threads are shown at van 
der Waals radii and the metal ions of the macrocycles are depicted by solid 
polyhedra reflecting their coordination geometries: Cr(im), orange 
octahedron; Co(11), purple octahedron; Cu(i1), blue square pyramid. Details 
of each set of intercomponent binding motifs are provided in the 
Supplementary Information. In a and b the divalent metal ion, Co(1), is 
disordered over the eight metal sites on each heterometallic wheel (as for 
[2]rotaxane 2c; Fig. 1b); in c the divalent metal ion, Cu(1), is localized at the 
positions indicated. The crystallographic data and experimental details of 
the structural refinement for 4a, 5b and 6b are provided in the 
Supplementary Information. 


65 components), without any significant quantity (<1%) of the 
corresponding [2]rotaxane. The X-ray structure (Fig. 4a) of a single 
crystal of 5b grown from a saturated solution in tetrahydrofuran/ 
acetonitrile shows types of intercomponent interactions similar to those 
observed with [2]rotaxane 2c, although at one of the macrocycle-thread 
binding sites the four NH°**F contacts are of such similar lengths that 
they can be considered to be pairs of bifurcated F-+**H(N‘H)***F~ 
hydrogen bonds (Supplementary Information). 

Notably, changing the divalent metal from Co(11) to Cu(m) led to a 
[4]rotaxane, 6b, with two thread molecules circumscribed by two 
heterometallic macrocycles**, each ring containing ten octahedral 
Cr(im) ions and two square-pyramidal Cu(i1) ions. By modifying 
the ratio of the trivalent:divalent metal-salt reagents from 7:1 to 
the 5:1 required for the stoichiometry of this product, we isolated 
[4]rotaxane 6b in 37% yield (from 98 components: 24 metal ions, 44 
pivalate groups, 28 fluoride ions and two organic threads). The X-ray 
structure (Fig. 4c) of a single crystal of 6b grown from a saturated 
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solution of the [4]rotaxane in diethyl ether and acetonitrile was 
solved with data collected using the Advanced Light Source synchro- 
tron, Lawrence Berkeley National Laboratory. The different coordi- 
nation requirements of Cu(m) (five coordinate) compared with 
Co(11) (six coordinate) leads to some enforced changes in the ligand 
arrangement for the larger heterometallic wheel, and in the 
[4]rotaxane two of the 14 fluoride ligands are bound to only one 
metal ion rather than bridging two. These two singly coordinated 
fluoride anions have a central role as hydrogen-bond acceptors in the 
intercomponent binding motifs of the [4]rotaxane in the solid state, 
each being involved in bifurcated hydrogen-bonding to two ammo- 
nium groups on different threads (Supplementary Information). 

By shortening the spacer between the ammonium units on the 
thread from twelve (3b) to six methylene groups (3a)—which is 
too short to allow heterometallic rings to reside simultaneously on 
both ammonium sites—and adding (chloromethyl)triethylammo- 
nium chloride to the reaction, we were able to form a [2]rotaxane 
with one heterometallic ring on a thread with two ammonium 
groups, 4a, in 33% yield (Fig. 3). The (chloromethyl)triethylammo- 
nium chloride was originally introduced accidentally to the reaction 
as an impurity, produced by the alkylation of triethylamine by 
dichloromethane during isolation of the thread by column chro- 
matography. However, when subsequent reactions using pure 
samples of the thread failed to produce any of rotaxane 4a, analysis 
of the original procedure established the presence of (chloromethyl)- 
triethylammonium chloride in the successful rotaxane-forming 
experiment. When (chloromethyl)triethylammonium chloride was 
deliberately added to the pure thread, the reaction once again 
produced rotaxane 4a. Other tetraalkylammonium salts, such as 
tetraethylammonium chloride, did not promote rotaxane formation 
when added to the reaction mixture. This suggests that the chloro- 
methyl group has a crucial role, possibly as an electrophilic site of 
temporary attachment for the tetraalkylammonium cation, which 
might then act as a macrocyclization template for a growing oligo- 
meric metal-pivalate-fluoride chain. 

Unusually for a hydrogen-bonded molecular shuttle, the random 
back-and-forth shuttling of the macrocycle between the two binding 
sites on the thread in 4a occurs on a very different timescale from the 
spinning of the ring about the axis of the thread. At 300K, the 500- 
MHz 'H NMR spectrum of 4a in a range of solvents (CDCI, 
C,D2Cl,, dg-tetrahydrofuran) shows a_ well-resolved pair of 
signals for each chemically distinct ‘stopper’ proton (Supple- 
mentary Information), the difference within each pair being caused 
by whether the adjacent ammonium centre is vacant or occupied by 
the heterocyclic ring, indicating that the shuttling between them is 
slow on the NMR timescale. Heating to 360 K (C,D Cl as solvent) 
did not significantly broaden, let alone coalesce, the signal pairs. 
However, the dynamic exchange of the ring between the two thread 
ammonium sites could be measured by polarization transfer between 
signal pairs during the mixing time in two-dimensional exchange 
spectroscopy experiments” (Supplementary Information). These 
measurements gave a rate of exchange of 1.2+0.5s_' at 330K in 
C,D2Cl,, corresponding to a free energy of activation (AG*) for 
shuttling of 19.3 + 0.2 kcal mol” '. This value is at least (and possibly 
much more than) 10 kcal mol! higher than the activation barrier to 
spinning of the ring. 

This significant difference can be rationalized from the intercom- 
ponent NH:::F hydrogen-bonding seen in the various crystal struc- 
tures (Figs 1b and 4). During spinning, the pseudo-Cg-symmetry of 
the bridging fluoride ions in the wheel enables the NH—F hydrogen 
bonds to be transferred from one fluoride ion to the next by a low- 
energy route that does not require significant breaking of an existing 
hydrogen bond before a new one starts to form, that is, conversion 
from a ‘short’ NH°:'F hydrogen bond to a ‘long’ one through a 
bifurcated system. These motifs are all seen in the X-ray structures 
of the various rotaxanes. By contrast, any shuttling mechanism for 4a 
must require complete breaking of all the intercomponent NH°*-F 
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hydrogen bonds before translocation of the wheel to the other 
ammonium binding site can occur. 

The linking of organic and inorganic components at the molecular 
level by mechanical bonding imparts features of both chemistries (for 
example dynamics and magnetism) to the overall structure, and 
introduces new and intriguing behaviour through the combination 
of the two (for example the differences in the mechanisms of rota- 
tional and translational intercomponent motion, new non-covalent 
binding motifs, extreme temperature sensitivity of chemical shifts, 
novel types of template mechanism and so on). The provision of a 
simple and versatile synthetic route for linking inorganic and organic 
components by mechanical bonds at the molecular level should make 
it possible to create molecules with composite properties, including 
new behaviours and characteristics that may arise by combining 
features that have previously been solely the preserve of one type of 
chemistry or the other. 
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1. Yaghi, O. M. et al. Reticular synthesis and the design of new materials. Nature 423, 
705-714 (2003). 
2. Cheetham, A. K. & Rao, C. N. R. There’s room in the middle. Science 318, 58-59 
(2007). 
3. Kitagawa, S., Kitaura, R. & Noro, S.-i. Functional porous coordination polymers. 
Angew. Chem. Int. Edn 43, 2334-2375 (2004). 
4. Ogino, H. Relatively high-yield syntheses of rotaxanes. Syntheses and properties 
of compounds consisting of cyclodextrins threaded by «, w-diaminoalkanes 
coordinated to cobalt(IIl) complexes. J. Am. Chem. Soc. 103, 1303-1304 (1981). 
5. Batten, S. R. & Robson, R. Interpenetrating nets: ordered, periodic entanglement. 
Angew. Chem. Int. Edn 37, 1460-1494 (1998). 
6. Sauvage, J.-P. & Dietrich-Buchecker, C. (eds). Molecular Catenanes, Rotaxanes and 
Knots: A Journey through the World of Molecular Topology (Wiley-VCH, 1999). 
7. Loeb, S.J. Metal organic rotaxane frameworks. Chem. Commun. 1511-1518 (2005). 
8. Fujita, M., lbukuro, F., Hagihara, H. & Ogura, K. Quantitative self-assembly of a 
[2]catenane from two preformed molecular rings. Nature 367, 720-723 (1994). 
9. Alam, M.A. et al. Directed 1D assembly of a ring-shaped inorganic nanocluster 
templated by an organic rigid-rod molecule. Angew. Chem. Int. Edn 47, 
2070-2073 (2008). 

O. Anelli, P.L., Spencer, N. & Stoddart, J. F. A molecular shuttle. J. Am. Chem. Soc. 113, 
5131-5133 (1991). 

1. Berna, J. et al. Macroscopic transport by synthetic molecular machines. Nature 
Mater. 4, 704-710 (2005). 

2. Nguyen, T. et al. A reversible molecular valve. Proc. Natl Acad. Sci. USA 102, 
10029-10034 (2005). 

3. Liu, Y. et al. Linear artificial molecular muscles. J. Am. Chem. Soc. 127, 9745-9759 
(2005). 

4. Green, J. E. et al. A 160-kilobit molecular electronic memory patterned at 10" bits 
per square centimetre. Nature 445, 414-417 (2007). 

5. Kay, E.R., Leigh, D. A. & Zerbetto, F. Synthetic molecular motors and mechanical 
machines. Angew. Chem. Int. Edn 46, 72-191 (2007). 

6. Leuenberger, M. N. & Loss, D. Quantum computing in molecular magnets. Nature 
410, 789-793 (2001). 

7. Bertaina, S. et al. Quantum oscillations in a molecular magnet. Nature 453, 
203-206 (2008). 

8. Winpenny, R. E. P. Quantum information processing using molecular 
nanomagnets as qubits. Angew. Chem. Int. Edn 47, 7992-7994 (2008). 

9. Larsen, F. K. et al. Synthesis and characterization of heterometallic {Cr7M} 
wheels. Angew. Chem. Int. Edn 42, 101-105 (2003). 

20. Affronte, M., Carretta, S., Timco, G. A. & Winpenny, R. E. P. A ring cycle: studies of 
heterometallic wheels. Chem. Commun. 1789-1797 (2007). 

21. Timco, G. A. et al. Influencing the nuclearity and constitution of heterometallic 
rings via templates. Chem. Commun. 3649-3651 (2005). 

22. Cador, O. et al. The magnetic Mobius strip: synthesis, structure and magnetic 
studies of odd-numbered antiferromagnetically coupled wheels. Angew. Chem. 
Int. Edn 43, 5196-5200 (2004). 

23. Kolchinski, A. G., Busch, D. H. & Alcock, N. W. Gaining control over molecular 
threading: benefits of second coordination sites and aqueous-organic interfaces 
in rotaxane synthesis. J. Chem. Soc. Chem. Commun. 1289-1291 (1995). 

24. Ashton, P. R. et al. Self-assembling [2]- and [3]rotaxanes from secondary 
dialkylammonium salts and crown ethers. Chem. Eur. J. 2, 729-736 (1996). 

25. Mock, W.L., Irra, T. A., Wepsiec, J. P. & Adhia, M. Catalysis by cucurbituril. The 
significance of bound-substrate destabilization for induced triazole formation. J. 
Org. Chem. 54, 5302-5308 (1989). 

26. Kim, K. Mechanically interlocked molecules incorporating cucurbituril and their 
supramolecular assemblies. Chem. Soc. Rev. 31, 96-107 (2002). 

27. Aucagne, V., Leigh, D. A., Lock, J.S. & Thomson, A. R. Rotaxanes of cyclic peptides. 
J. Am. Chem. Soc. 128, 1784-1785 (2006). 

28. Ashton, P. R. et al. Doubly encircled and double-stranded pseudorotaxanes. 
Angew. Chem. Int. Edn Engl. 34, 1869-1871 (1995). 


317 


©2009 Macmillan Publishers Limited. All rights reserved 


LETTERS 


29. Perrin, C. L. & Dwyer, T. J. Application of two-dimensional NMR to kinetics of 
chemical exchange. Chem. Rev. 90, 935-967 (1990). 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements We thank J. Bella for the exchange spectroscopy NMR 
experiments, W. Sun for assistance with the preparation of thread 1¢ and the 
Engineering and Physical Sciences Research Council (EPSRC) National Mass 
Spectrometry Service Centre (Swansea, UK) for high-resolution mass 
spectrometry. This research was funded by the European Commission (through 
the NoE ‘MAGMANet’) and EPSRC. The Advanced Light Source is supported by 
the Director, Office of Science, Office of Basic Energy Sciences, of the US 
Department of Energy under contract no. DE-ACO2-05CH11231. D.S. is a Swiss 
National Science Foundation postdoctoral fellow. D.A.L. is an EPSRC Senior 
Research Fellow and holds a Royal Society Wolfson Research Merit Award. 


318 


NATURE|Vol 458|19 March 2009 


Author Contributions C.-F.L., D.S. and G.A.T. carried out the synthesis and 
characterization studies, helped plan the experiments and participated in the 
preparation of the manuscript. R.G.P. and S.J.T. collected the X-ray data and solved 
he crystal structures. D.A.L. and R.E.P.W. helped plan the experiments and prepare 
he manuscript. 


Author Information The crystallographic data and experimental details of the 
structural refinement for the X-ray crystal structures reported in this paper have 
been deposited at the Cambridge Crystallographic Data Centre, under deposition 
numbers CCDC 705132-CCDC 705135. These data can be obtained free of charge 
rom the Cambridge Crystallographic Data Centre (http://www.ccdc.cam.ac.uk/ 
data_request/cif). Reprints and permissions information is available at 
www.nature.com/reprints. Correspondence and requests for materials should be 
addressed to D.A.L. (david.leigh@ed.ac.uk) or R.E.P.W. 
(richard.winpenny@manchester.ac.uk). 


©2009 Macmillan Publishers Limited. All rights reserved 


Vol 458|19 March 2009|doi:10.1038/natureO7818 


nature 


LETTERS 


Temperature-dependent thermal diffusivity of the 
Earth's crust and implications for magmatism 


Alan G. Whittington’, Anne M. Hofmeister” & Peter I. Nabelek’ 


The thermal evolution of planetary crust and lithosphere is largely 
governed by the rate of heat transfer by conduction’. The govern- 
ing physical properties are thermal diffusivity (x) and conductivity 
(k= KpCp), where p denotes density and Cp denotes specific heat 
capacity at constant pressure. Although for crustal rocks both x and 
k decrease above ambient temperature*’, most thermal models of 
the Earth’s lithosphere assume constant values for « (~1 mm/’s_') 
and/or k (~3 to 5Wm__' K~')®*” owing to the large experimental 
uncertainties associated with conventional contact methods at high 
temperatures. Recent advances in laser-flash analysis*” permit 
accurate (+2 per cent) measurements on minerals and rocks to 
geologically relevant temperatures’. Here we provide data from 
laser-flash analysis for three different crustal rock types, showing 
that « strongly decreases from 1.5-2.5 mm/s ' at ambient condi- 
tions, approaching 0.5mm’ s~' at mid-crustal temperatures. The 
latter value is approximately half that commonly assumed, and hot 
middle to lower crust is therefore a much more effective thermal 
insulator than previously thought. Above the quartz a-f phase 
transition, crustal x is nearly independent of temperature, and 
similar to that of mantle materials''. Calculated values of k indicate 
that its negative dependence on temperature is smaller than that of 
K, owing to the increase of Cp with increasing temperature, but k 
also diminishes by 50 per cent from the surface to the quartz o-f 
transition. We present models of lithospheric thermal evolution 
during continental collision and demonstrate that the temperature 
dependence of « and Cp leads to positive feedback between strain 
heating in shear zones and more efficient thermal insulation, 
removing the requirement for unusually high radiogenic heat 
production to achieve crustal melting temperatures. Positive feed- 
back between heating, increased thermal insulation and partial 
melting is predicted to occur in many tectonic settings, and in 
both the crust and the mantle, facilitating crustal reworking and 
planetary differentiation”. 

Thermal diffusivity data for garnet schist, leucogranite and welded 
rhyolitic ash-flow tuff were acquired at temperatures of up to 1,260 K 
using laser flash analysis, which isolates the phonon component of 
heat transfer from radiative transfer and avoids thermal contact 
losses’. Graphite-coated wafers approximately 1-mm thick and 
10mm in diameter are held at some temperature in a furnace. 
Thermal diffusivity is determined from the time-dependent response 
of the sample to an additional pulse of heat, supplied by a gadolinium— 
gallium—garnet laser to the bottom of the sample, which is monitored 
using an indium antimonide detector above the sample. The leuco- 
granite and rhyolite are homogeneous and isotropic. The schist 
is anisotropic owing to alternating mica- and quartz-rich layers, so 
samples were cut parallel and perpendicular to foliation. 

Room-temperature (~298 K) thermal diffusivities ranged from 
1.3 to 1.8mm/’s ! for the schist, from 1.5 to 2.1mm/’s ! for the 


granite and from 1.8 to 2.2mm7s_' for the rhyolite (Fig. 1). For 
schist slices cut perpendicular to foliation, heat flow travels parallel 
to foliation and averages alternating compositional layers; these per- 
pendicular slices yielded values that are intermediate between values 
for quartz-rich (high-«) and mica-rich (low-x) layers sampled by 
cuts parallel to foliation (when heat flow is across each layer). 
Variations in « among different granite and rhyolite samples were 
also ascribed to differences in the amount of quartz sampled by the 
laser spot, which is 6 mm in diameter. 

For all samples, « decreased rapidly with increasing temperature, 
and asymptotically approached a high-temperature limit. Variations 
among rocks, and among different samples of the same rock, 
decreased markedly between room temperature and 600 K. Above 
~750K, the « of the schist samples became very low, probably as a 
result of differential thermal expansion and cracking along cleavage 
planes, perhaps accompanied by mica dehydration. The data suggest 
a break in slope at the quartz af transition temperature, followed by 
a slight, quasi-linear decrease in k above 846K. This trend is also 
observed in quartzites™. 

Data for the two rhyolite samples, two granite samples and the 
perpendicular schist were fitted by the following equations, which 
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Figure 1| Thermal diffusivity of crustal rocks as a function of temperature. 
Data are given in Supplementary Information. Uncertainties are smaller 
than symbol size in most cases. Curve for average crust calculated using 
equations (1) and (2) below and above the quartz o—[ transition, 
respectively. Curves for KAISi,;Og glass and liquid are from ref. 28. para, 
parallel; perp, perpendicular. 
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should be a good estimate of the temperature dependence of thermal 
diffusivity in the continental crust: 


Kerust(  < 846K) = 567.3/T —0.062 (1) 


Kerust(T > 846K) = 0.732 —0.000135T (2) 


Here xk is measured in square millimetres per second and T (temper- 
ature) is measured in kelvin. Crystalline garnet, pyroxene and olivine 
all have low average « values in the range 0.6-0.7 mm’ s_! at upper- 
mantle temperatures'®'""’, so any change in xk across the crust—-mantle 
boundary is likely to be minor. Rather, it is within the middle and 
upper crust that « changes significantly, by a factor of 3-5. 

To calculate thermal conductivity, we derived the following ‘bulk 
crustal’ specific heat capacity equations from end-member mineral 
data’®, on the basis of a mineralogy of 30% quartz, 60% albite, 5% 
phlogopite and 5% annite: 


Cp crust(T <846K) = 199.50 +0.0857T —5.0x 10°-°T~* (3) 


Cp crust(T > 846K) = 229.32 + 0.0323T —47.9x 10~°T~? (4) 


Here Cp is measured in joules per mole per kelvin, the average molar 
mass is 221.78gmol' and T is measured in kelvin. Although the 
precise values of Cp depend on modal mineralogy, calculated differe- 
nces in Cp among several possible assemblages such as tonalite, granite 
and schist are <3% at 1,000 K. We assumed a constant density of 
2,700kg m7 *, because the increase in density due to compression is 
partly off-set by the decrease due to higher temperatures, and these 
changes are small in comparison with changes in Cp and k. 

Calculated values for crustal k decrease with increasing temper- 
ature, from ~3.8Wm !K ! at the surface to ~1.9Wm 'K ‘at 
the quartz o-f transition, and then decrease only very slightly at 
higher temperatures (Fig. 2). Our calculations suggest that the k of 
the lower crust is 25% lower than previous estimates’’, although this 
depends in part on assumed mineralogy. At 1,000 K, the k of olivine'” 
is 3.0Wm 'K~', which is 60% higher than that of the crust. 
Therefore, the middle to lower crust is the most thermally insulating 
portion of continental lithosphere. 

Negative temperature derivatives of « and k may lead to a positive 
feedback between temperature increase and heat retention due to 
decreased thermal diffusivity, in the presence of a heat source. 
Potential heat sources for continental crust include heat flux from 
the mantle, radioactive decay, strain heating in deforming rocks and 
intrusion of mantle-derived basaltic magma. Collisional orogenic 
belts such as the Himalayas do not derive heat from basalts, yet 
crustal melting and the generation of leucogranites is a universal 
feature of such orogens'®. The mechanism for achieving this has long 
been debated'’, with most models favouring either unusually high 
radiogenic heat production in the middle crust”’”' or strain heating 
in major shear zones. 

To test the effect of a temperature-dependent « on the thermal 
structure of orogenic belts, we used the finite-difference code 
OROGEN” to model the one-dimensional thermal evolution of a 
doubly thickened crust. The code was modified to incorporate the 
temperature-dependent « and Cp from equations (1), (2), (3) and 
(4), and solves the heat flow equation: 


OT _ 6 0T | AnatAs (5) 
at Oz oz pts 


The initial conditions assume stacking of 35-km-thick crust giving the 
initial sawtooth geotherm. A vertical grid spacing of 250 m was used. 
The models included radiogenic heat production (A,.q) as follows: Arag 
was 21Wm> at the surface and reduced exponentially with depth, 
with a drop-off length of 15 km, to 35 km. Between depths of 35 and 
70km, A,aq Was constant at 0.2 w4W_m°, and between 70 and 130km 
it was constant at 0.02Wm_~’. For the mantle, the temperature- 
dependent values of Cp and x were assumed to be those of olivine. 
Strain heating (A,) occurred within a 3-km-wide shear zone at a depth 


320 


NATURE|Vol 458|19 March 2009 


i) 
° 
ed 
fo) 


x 
a 
ak 
a 


Thermal diffusivity (mm? s-) 
° 
a 


= 
(| ,-6 p) Ayoedeo yeay oyloads 


° 
fo) 


500 750 1,000 
Temperature (K) 


1,250 


Thermal conductivity (W m-1 K-1) 


Upper crust 
(ref. 17) 


KAISi,0, glass 


250 450 650 850 
Temperature (K) 


1,050 1,250 


Figure 2 | Thermal conductivity as a function of temperature. Curve for 
average crust calculated using equations (1), (2), (3) and (4) and a density of 
2,700 kgm *. Curve for olivine calculated from ref. 13. Curves for upper and 
lower crust calculated from ref. 17; dotted curves include pressure 
dependence of k, assuming a geotherm of 15 °Ckm_', and solid curves 
ignore pressure dependence. Curves for KAISi;Og glass and liquid are from 
ref. 28. Inset, contrasting temperature dependence of thermal diffusivity and 
specific heat capacity, calculated for average crust from equations (1), (2), 
(3) and (4). 


of 35 km. The rate of strain heating was given by tv/z where t (shear 
stress) is 30 MPa, v (thrusting velocity) is 3 cm yr ' and z denotes the 
characteristic width of the shear zone. The effect of strain heating 
decreased in a Gaussian fashion with distance from the centre of the 
shear zone (dz) according to exp(—dz I2). Temperature was fixed at 
1,300 °C at the bottom of 130-km-thick lithosphere. Thrusting was 
assumed to occur for 40 Myr, followed by 40 Myr of thermal relaxation. 

Models with a constant « of 1 mm?’ s"! (Fig. 3a) show that the schist 
solidus'*”® is not reached during 40 Myr of thrusting, and thermal 
relaxation towards a steady-state geotherm after thrusting does not 
result in attainment of the temperatures required for crustal melting. 
Incorporating a temperature-dependent x accelerates the effect of 
strain heating because heat does not diffuse efficiently away from 
the hot shear zone (Fig. 3b). In this model, the schist solidus is 
reached after 40 Myr. Models with constant radiogenic heat produc- 
tion throughout the lithosphere, for example of 0.1 1Wm°* (not 
shown), produce straighter steady-state geotherms, but the effect of 
a temperature-dependent x on strain heating is similar. 

Our models indicate that the high degree of thermal insulation 
provided by hot rocks leads to retention of heat generated by strain 
heating, and that unusually high radiogenic heat production at mid- 
crustal levels is not required for melting in orogenic belts. We 
emphasize that the amounts of radiogenic heating used in our models 
are conservative, and that higher values will lead to earlier and more 
widespread crustal melting. Moreover, because thermal diffusion 
through the hot lithospheric mantle and lower crust is slow, there 
is a more pronounced curvature to the steady-state geotherm with 
more potential for melting in the lower crust. 

Once melting begins, rock strength decreases and the strain-heating 
mechanism should cease to be effective’”. However, the k of KAISi;O 
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Figure 3 | Thermal models for doubly thickened 
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glasses and melts is ~20% lower than that of KAISi,Og crystals”’, the k 
of KAISi;Og glasses and melts is 15-20% lower than that of the bulk 
crust (Fig. 2), and dissolved water also decreases x (ref. 29), meaning 
that hydrous granitic liquids should be particularly efficient insulating 
materials. The onset of crustal melting may therefore lead to the 
production of a more insulating layer than the surrounding crust. 
This positive feedback between melting and thermal insulation may 
promote increased melt fraction, and allows strain heating to play a 
significant role in triggering crustal anatexis even though it must 
become negligible once melting is achieved. 

The results of our study pertain to other settings in which partial 
melting occurs. In both arc and intraplate volcanic settings, basaltic 
intrusions supply the heat necessary for crustal melting and produc- 
tion of voluminous silicic magmas. Thermal modelling shows that 
the low thermal diffusivity of hot crust reduces both the long-term 
flux of mafic magma needed to produce the observed quantities of 
silicic magma, and the incubation time between the onset of basalt 
injection and the first eruption of rhyolites*®. We find values of « for 
crustal rocks that are even lower than used in these models, which 
should result in shorter calculated incubation times. Furthermore, 
the positive-feedback relations between heating, increased thermal 
insulation and partial melting will also occur in the mantle, facilitat- 
ing basaltic magmatism and planetary-scale differentiation. 
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Obliquity-paced Pliocene West Antarctic ice sheet 


oscillations 
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M. Taviani?’, S. Vogel’, T. Wilch?® & T. Williams”? 


Thirty years after oxygen isotope records from microfossils deposited 
in ocean sediments confirmed the hypothesis that variations in the 
Earth’s orbital geometry control the ice ages', fundamental questions 
remain over the response of the Antarctic ice sheets to orbital cycles’. 
Furthermore, an understanding of the behaviour of the marine- 
based West Antarctic ice sheet (WAIS) during the ‘warmer- 
than-present’ early-Pliocene epoch (~5-3 Myr ago) is needed to 
better constrain the possible range of ice-sheet behaviour in the 
context of future global warming’. Here we present a marine glacial 
record from the upper 600 m of the AND-1B sediment core recovered 
from beneath the northwest part of the Ross ice shelf by the 
ANDRILL programme and demonstrate well-dated, ~40-kyr cyclic 
variations in ice-sheet extent linked to cycles in insolation influenced 
by changes in the Earth’s axial tilt (obliquity) during the Pliocene. 
Our data provide direct evidence for orbitally induced oscillations in 
the WAIS, which periodically collapsed, resulting in a switch from 
grounded ice, or ice shelves, to open waters in the Ross embayment 
when planetary temperatures were up to ~3 °C warmer than today* 
and atmospheric CO, concentration was as high as ~400 p.p.m.v. 
(refs 5, 6). The evidence is consistent with a new ice-sheet/ice-shelf 
model’ that simulates fluctuations in Antarctic ice volume of up to 
+7 m in equivalent sea level associated with the loss of the WAIS and 
up to +3 m in equivalent sea level from the East Antarctic ice sheet, in 
response to ocean-induced melting paced by obliquity. During inter- 
glacial times, diatomaceous sediments indicate high surface-water 
productivity, minimal summer sea ice and air temperatures above 
freezing, suggesting an additional influence of surface melt* under 
conditions of elevated CO}. 


The Earth’s climate system during the Pliocene and early- 
Pleistocene epochs was regulated by a ~40-kyr periodicity. The geo- 
logical evidence for this is widespread and expressed in polar to 
equatorial depositional environments including (1) ice volume from 
oxygen isotope (5'%O) records, that co-vary with the pattern of ice- 
rafted debris in deep-sea sediments’; (2) ocean circulation’? and 
temperature also inferred from deep-sea sediment proxies'””’; (3) 
atmospheric circulation from continental dust deposits’’; and (4) 
global sea-level fluctuations recorded in the shallow-marine continental 
margins". 

The 40-kyr cycle is almost certainly linked to variations in the 
Earth’s orbital obliquity. However, the specific nature of this forcing 
and its influence on Antarctic glaciation remain unresolved owing to a 
lack of well-dated climate records that directly sample past oscillations 
of the ice sheet. The new AND- 1B core provides such a record (Fig. 1). 
In this Letter, we focus on the early Pliocene (~5—3 Myr ago) part of 
the record, because for this period the response of Antarctic ice sheets 
to orbital forcing can be studied without the complicating influence of 
large Northern Hemisphere ice sheets on sea-level and deep-sea 5'°O 
records’. Furthermore, polar ice-sheet boundary conditions were 
similar to today, but the climate was warmer*”®. With anthropogenic 
warming projected to rise an average of ~3 “Cin mean temperature by 
the end of the twenty-first century, more significance is being placed 
on the early Pliocene as an analogue for understanding the future 
behaviour of the WAIS* and its contribution to global sea level’®. 
Far-field geological evidence for palaeoshorelines up to 25m above 
present'”"* are consistent with ice-volume estimates from deep-ocean 
5'%O data!’, and imply deglaciation of the Greenland ice sheet, the 
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Figure 1| Location of the ANDRILL McMurdo Ice Shelf Project AND-1B drill 
site in the northwestern corner of the Ross ice shelf. Also shown are the 
locations of previous Deep Sea Drilling Project (DSDP), Integrated Ocean 
Drilling Program (IODP) and Ocean Drilling Program (ODP) cores, Plio- 
Pleistocene cores, percentage recovery (from shelf sites) and geographic 
features including ice-sheet configuration and flow lines. (Supplementary 
Figs 1 and 2 show more detail of the glaciologic and geologic setting). The 
dashed contour indicates a depth of 1,000 m below sea-level. 


WAIS and the marine margins of the East Antarctic ice sheet (EAIS) 
during the warmest early Pliocene interglacials. 

Thirty-eight glacimarine cycles, each bounded by glacial surfaces 
of erosion (GSEs), occur in the upper 600 m of the core, and record 
oscillations in the extent of an ice sheet in Ross embayment during 
the past 5 Myr (Fig. 2)*°. The drilled strata accumulated in the rift axis 
of the Victoria Land basin, ~100km seaward of the coast. 
Accommodation space for the preservation of the sediments, and 
their protection from subsequent glacial erosion, was provided by 
high rates of tectonic subsidence due to a unique combination of 
regional rift extension and flexure of the crust by local volcanic 
islands (Supplementary Information). 

Figures 3 and 4 summarize our interpretations of individual sedi- 
mentary cycles in terms of the vertical occurrences of lithofacies, that 
is, sediments representing specific environments of deposition. These 
range from marine-diatom-rich deposits and mudstones deposited 
during interglacials to ice-proximal diamictites, conglomerates and 
sandstones representing glacial periods. During glacial periods, the ice 
sheet had a laterally extensive marine terminus extending well beyond 
the drill site, out into the Ross Sea. During interglacials, the drill site 
was either covered by an ice shelf or, when the ice sheet retreated, lay in 
open water. The sedimentary characteristics of the cycles and the 
approach used for their interpretation are given in more detail 
elsewhere! (Methods). We note here that changes in lithofacies 
through time primarily reflect the proximity of the ice-sheet ground- 
ing zone and the thermal characteristics of the depositing ice (Fig. 2 
and Supplementary Information). Such inferences are consistent with 
depositional models from a variety of different glacimarine 
regimes’*”*, and permit the identification of 38 oscillations in the 
extent of the ice sheet’s grounding line. 

The composition of till (ice-contact diamictites), which overlie 
GSEs and represent sediments transported at the base of a grounded 
ice sheet, show that the ice originated from large outlet glaciers in the 
Transantarctic Mountains (TAM), especially the Mulock and Skelton 
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glaciers south of Minna Bluff’. A new continental Antarctic ice-sheet 
model’ run for the past 5 Myr supports geological interpretations 
that the provenance of grounded ice at the AND-1B site is always 
from nearby southern-TAM outlet glaciers during glacial advances 
(Fig. 2). 

Notably, the model finds that local ice variations at the AND-1B 
site are indicative of the overall West Antarctic glacial state, because 
both are controlled by variations in ocean-induced melt. When 
open-water marine sediments occur in the AND-1B core, the model 
shows not only deglaciation in the western Ross embayment, but also 
the collapse of the entire WAIS (Fig. 2). Pliocene—Pleistocene varia- 
tions in ice volume are dominated by large WAIS advances and 
retreats, while the high-altitude regions of the EAIS remain relatively 
stable. This is because air temperatures never become warm enough 
to cause significant surface melting on the EAIS™, whereas variations 
in ocean-induced melt and sea level affect the marine-based WAIS 
much more than the EAIS. Thus, the sedimentary cycles in the AND- 
1B core both track local variations of the coastal margin of the EAIS 
(for example TAM outlet glaciers) and provide physical evidence for 
major changes in the mass balance of the WAIS (Supplementary 
Information). 

Figure 2 illustrates the stratigraphic position of 26 chronological 
datums that are used to constrain the age and duration of the 38 sedi- 
mentary cycles and identify the time missing at cycle-bounding erosion 
surfaces, that is, unconformities (Supplementary Table 1). The chro- 
nology is developed from *°Ar/**Ar ages of volcanic deposits and a 
quantitative diatom biostratigraphy used to constrain the correlation 
between the magnetic polarity stratigraphy and the geomagnetic 
polarity timescale*'. The approach used, and associated uncertainties, 
are outlined in Methods. About 36% of the last 5 Myr is represented as 
rock in the AND-1B core; the rest is lost at unconformities resulting 
from erosion through long-term tectonic influences and shorter-term 
volcanic and glacial processes. Chronostratigraphic constraints enable 
identification of two types of unconformities: (1) those where the time 
missing is longer than a Milankovitch cycle, interpreted as major 
erosion due to tectonic influences and/or a major phase of glacial 
advance; and (2) those of suborbital duration reflecting lesser glacial 
erosion associated with a single glacial advance truncating only part of 
the previous cycle. The chronology also allows the duration of relatively 
continuous stratigraphic packages comprising more than one cycle to 
be estimated with sufficient precision for the recognition of orbital 
periods” (Methods). 

One such interval is illustrated in detail in Fig. 3 and comprises six 
early-Pliocene glacial cycles (cycles 38-33), spanning 265 kyr. In this 
case, the identification of three palacomagnetic reversal boundaries 
allows one-to-one matching of the WAIS grounding-line oscillations 
recorded in the AND-1B core to individual 40-kyr ice-volume cycles 
in the deep-sea benthic 5'%O stack”* and modelled ice-volume cycles 
between 4.896 and 4.631 Myr ago. Figure 4 illustrates an interval in 
which the chronology constrains 16 successive early- to mid-Pliocene 
(3.60-2.87 Myr ago) cycles (32a—d to 20), spanning 700 kyr. In this 
case, the cyclostratigraphic interpretation and the distribution of 
time within the AND-1B core are not so straightforward, primarily 
because a ~60-m-thick interval of marine diatomite, spanning 
~200 kyr, occurs between 438 and 376 m.b.s.f. We use the distri- 
bution of >2-mm-diameter iceberg-rafted debris (IBRD) through 
the continuous, thick diatomite unit as an index for ice-rafting 
intensity and glacial variability. The IBRD record identifies four or 
five main orbital cycles (32a—d) reflecting glacial fluctuations during 
an extended period of biopelagic deposition in the Ross embayment, 
when the ice sheet remained landward of the drill site (Fig. 2). The 
composition of the IBRD reflects oscillations of local outlet glaciers, 
which remained near the coast with no significant expansion into the 
Ross Sea”’. 

If the four or five IBRD cycles in the diatomite and the eleven 
overlying unconformity-bounded cycles are distributed evenly over 
this time interval, the resulting duration is ~40 kyr per cycle (Fig. 4). 
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The short-duration normal-polarity interval between the Mammoth 
and Kaena subchrons is not represented in the AND-1B core and could 
be missing at any of the GSEs between the bases of cycles 28-22. 
However, if cycles 28—22 are distributed evenly across the amalgamated 
reversed-polarity subchrons between 3.30 and 3.03 Myr ago, then a 
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one-to-one match with ~40-kyr 8/°O cycles results and suggests that 
an intervening normal-polarity interval is most likely missing at the 
basal GSE of cycle 26. 

The 60-m-thick diatomite unit lacks a sea-ice-associated diatom 
flora’’, and sedimentological evidence implies that warmer-than-present 


= 
4 a b c d e 
a ' 
€ 3 Lithologic log Glacial Duration Cycle — Chronostratigraphic Benthic 5'°O Modelled Antarctic Obliquity Mean annual Summer ATS, 
=s = 2 AND-1B proximity (no: no. datum stack (%o) ice volume (m) () Insolavion, 235 insolation, 2004 
= = Ss o cycles) ° (W m*) 75°S (Wm?) (Myr) 
OSs eo 9 ! P DM 5.0 4.5 4.0 3.5 3.0 -24-16 -8 0 8 383940414243.0 FE SP LP PS 
2E&s & S| Mua |sandhoran a eee fc a a Lbiiiit Fe Ee eee ed ee ae Be 
= araity cores ~ 0 
Neeeeee f 
oS 800 kyr? r= 
£ (8 cycles?) Oo 
= Bre 
E oOo. 
af 
0.5 ES 
x 
fa) 
oO 
= 
5 | 
13} 
2 aera RFE (2 cycles) t-. is 
o = 
a 
mio. 
Jaramillo 
y! = 
(2 cycles) FF/1.67+0.03 
4 1 Lb e/51.945 a 
=H/52.01 E 
Sb 41.5 E€ 
cee — cee 5: 6 
2 
© cs 
= So 
eon Da eee 82 
40 kyr?! NKied.79 . a2 
(1 cycle?) 20 1 7/<3'87 Olduvai $e 
eee) Hn oO 
-M/>2.87 ~ 2 
+ 2.0 Og 
— 
Loo g E 
& a 
Ee eee Tn escapee anemones > 
5 
co 
= 
F2.5 
¥ 
rT 7 7Y 
no 
ao 
S| 
a hee & 
TS eeennccaal 12) 
400 kyr 28 
(9/10 cycles) 
S 
a 
2 € 
é : 
oO i 
c a Do 
g tho =| EES aw, | 8 8&;| | 8 Peo ® 
3 % Qs 
2 E i ca 
a a f SE 
o go 
3 ° 3 
me On 
No. per 10 cm + 4.0 2 
soon bn 
earn | Oo 
2 
| Cochiti 
3 
(0) 
45 
Nunivak 
519.40 ...J 265 kyr ERE ener ee 
(6 cycles) 4-779 Myr ago~ 2 
36 oO); 
| Sidufjall 
Gilb. 
E- 
38 
BESS MYiees 120'-80 "-40 ' 6 "4050 40 35 30 25 20 8 
bce Z/4.896 M ae 
596.35 eal Global sea level Ice volume (10° km’) = 
6004 (metres below present) S 
f ET P 9 ET P h er p ' eT Pp J ET P 
> 0.08 0.4 0.8 0.8 0.2 
2 0.06 0.3 0.6 0.6 0.15 
5 0.04 0.2 0.4 0.4 o4 
3 0.02 0.4 0.2 0.2 0.05 
a 


0.02 0:04 0.06 0 0.02 0:04 0:06 0 002 004 006 4 0.02 0:04 0.06 


0 0.02 0104 0.06 0 
1/f (kyr) 1/f (kyr) 1/f (kyr) 1/f (kyr) 4/F (kyr) 
ae Siltstone FA volcanic --- West Antarctica and Greenland melted Glacial surface of erosion (<1 cycle missing) 
ee Present-day ice volume ~~—— 21 40-kyr cycle eroded at unconformity 
Mudstone/sandstone = Diamictite --- Fully glaciated Antarctica (LGM eqv.) Unconformities of known duration 
--- Fully glaciated Antarctica + Northern Cycles of known duration 
Hemisphere (LGM eqv.) Cycles and unconformities of uncertain duration 


324 


©2009 Macmillan Publishers Limited. All rights reserved 


NATURE|Vol 458|19 March 2009 


LETTERS 


ie a b c d e 
£5 ‘ i ‘ ‘ sat } Modelled anes Mean annual Summer energy, ATS, 
A 18; 
8 g penal Facies Glacial proximity and Cycle Chronostratigraphic Benthic 8180 Antarctic ice Obliquity insolation, 75°S 75° S (GJ m™~) 2004 
eg - code environments no. datum stack (%o) volume (m) (wm) (c = 300 Wm) (Myr) 
[Mud _| Sand |Grav.] 1 PD M 323.0280 2 4 6 8 s8a9404ta2as Oo SP O39 41 43 45 
a a iy i ; ; 
Open marine U/<4.78 
V/4.631 


Grounding zone 


Subglacial 


Grounding zone 
ubglacial 


Open marine 


Grounding zone 
(outwash) 


265 kyr 
(6 cycles) 520 


Prox. open marine 
Grounding zone 
Grounding zone 


SOQ OE [EOP NP 


1 Subglacial 

9 Grounding zone 
4 (outwash) 
3 

2 

1 

10,9 

4 

10 

2,3 Prox. open 
1 marine 


Open marine 


Top of ~150-m-thick submarine volcanic interval 


ES Diatomite B Siltstone BA Volcanic 


Ey Mudstone/sandstone [i Diamictite 


Figure 3 | Detailed analysis of early-Pliocene sedimentary cycles in the 
AND-1B core showing lithofacies interpretations of glacimarine 
environments. The glacial proximity curve tracks the relative position of the 
grounding line through ice-contact (I), ice-proximal (P), ice-distal (D) and 
marine (M) environments and provides a proxy for ice-sheet extent. Cycle 
duration is constrained by chronostratigraphic datums coded A—Z. The 
chronostratigraphy allows a one-to-one correlation of the cycles with 


oceanic and atmospheric conditions existed between ~3.6 and 3.4 Myr 
ago. Foraminiferal 5'°O values during this time are up to 0.4%o lower 
than during the Holocene epoch, with amplitudes of 0.15 to 0.35%0 
equivalent to eustatic sea-level fluctuations (including Northern 
Hemisphere ice; see Methods) of +10-30m. Modelled Antarctic ice 
volume reduces to 20 X 10°km*, or +7m in equivalent sea level 
(Fig. 2), and involves the complete deglaciation of WAIS together with 
a loss of up to +3m in equivalent sea level (5%) from the marine 
margins of the EAIS’. If complete deglaciation of Greenland (+5 m) 
also occurred during this time, then the Antarctic ice-sheet history from 
isotopes, proximal data and numerical models is in good agreement with 
the early-Pliocene eustatic sea-level fluctuation amplitudes of + 10-30 m 


Y/4.86 gta 


3 Sandstone/conglomerate 


Se eee eee 


Q 
BSS 
Sidufjall 


~—_— Glacial surface of erosion (<1 cycle missing) 
ses >1 40-kyr cycle eroded at unconformity 


Unconformities of known duration 
Cycles of known duration 
Cycles and unconformities of uncertain duration 


obliquity-paced time series of the benthic 5'°O record”® (a), modelled 
Antarctic ice volume’ (expressed in metres of equivalent sea level, 

b), obliquity (c), mean annual insolation at 75° S (d), and summer energy at 
75° S for inferred melt threshold (t) of 300 W m ? (e; see Supplementary 
Information for further explanation of summer energy). Dashed vertical 
lines represent present-day ice volume/sea level. (See Methods and 
Supplementary Table 2 for explanation of facies codes.) 


reconstructed from far-field sites such as the uplifted shallow-marine 
Wanganui basin (New Zealand)". 

The WAIS grounding line re-advanced across the western Ross Sea 
following deposition of the thick diatomite interval, as shown by the 
occurrence of progressively more ice-proximal facies in the core 
between sedimentary cycles 31 and 29 (Fig. 4). The interval is correlated 
with a 0.7%o increase in 8'°O between marine isotope stages Mg5 and 
M2, and a corresponding increase in modelled Antarctic ice volume 
equivalent to a eustatic fall from 7 m above present-day sea level to 1 m 
below’. The M2 glacial appears to terminate early-Pliocene warm 
conditions in the Ross embayment. Our glacial proximity curve shows 
that WAIS expansion across the drill site occurred during the early 


Figure 2 | Stratigraphic and chronologic summary of the upper 600 m of 
the AND-1B core showing 38 sedimentary cycles of ice-sheet advance, 
retreat and re-advance during the last 5 Myr. Lithologies (rock types) are 
plotted against depth. The glacial proximity curve tracks the relative position 
of the grounding line through ice-contact (I), ice-proximal (P), ice-distal (D) 
and marine (M) environments and provides a proxy for ice-sheet extent. 
Cycle duration is constrained by chronostratigraphic datums coded A—Z 
(Supplementary Table 1) and is explained in more detail in Methods. The 
chronostratigraphy allows correlation of some cycles (blue shading) with 
time series of the stacked benthic 8'°O record”* (5'°O = (180/ *©O) ample! 
(30/ 160) sow — 1; SMOW, standard mean ocean water; a), model-output 
Antarctic ice volume’ (expressed in metres of equivalent sea level, 


b), obliquity (c), mean annual insolation at 75° S (d) and summer insolation 
intensity at 75° S (e). Spectral estimates show that the majority of the 
variance is at ~40-kyr frequency for the 5'°O record (f), obliquity (h) and 
mean annual insolation (i). Summer insolation at 75° S has equal power in 
the obliquity and precession bands (j). The relative increase in 100-kyr 
power in model-output Antarctic ice volume (g) reflects the increased 
influence of large Northern Hemisphere ice sheets on ocean temperature 
and glacioeustatic fluctuations in the late Pleistocene. In f-j: E, eccentricity 
or ~100-kyr cycle of unknown origin; T, tilt (obliquity); P, precession; f, 
frequency; vertical axes, power as an arbitrary scale of relative spectral 
power. m.b.s.f., metres below sea floor; LGM, Last Glacial Maximum. Thin 
diamictites (<3 m) are represented by a horizontal line with dot at end. 
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Figure 4 | Detailed analysis of late early-Pliocene sedimentary cycles in 
the AND-1B core showing lithofacies interpretations of glacimarine 
environments. Glacial proximity curve tracks the relative position of the 
grounding line through ice-contact (I), ice-proximal (P), ice-distal (D) and 
marine (M) environments and provides a proxy for ice-sheet extent. Cycle 
duration is constrained by chronostratigraphic datums coded A—Z. The 


Pliocene when Holocene 8'%O values and present-day Antarctic ice 
volumes (27 X 10° km*) were exceeded. An up-core transition between 
350 and 150 m.b.s.f. is evident, marked by successively fewer submarine 
outwash deposits and thinner grounding-zone facies successions 
representing cooling of the ice sheet implied by progressively lower 
volumes of melt water (Supplementary Information and ref. 21). This 
culminates with a major cooling step that occurs across the 
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eee >1 40-kyr cycle eroded at unconformity 


Cycles of known duration 
Cycles and unconformities of uncertain duration 


chronostratigraphy allows recognition of 15 cycles, within a ~600-kyr 
period, that are correlated with obliquity-paced time series of the benthic 
3180 record”* (a), modelled Antarctic ice volume’ (b), obliquity (¢), mean 
annual insolation at 75° S (d) and summer energy at 75° S for t = 300 Wm > 
(e). Dashed vertical lines represent present-day ice volume/sea level. 
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Gauss—Matuyama polarity transition (2.6 Myr ago) associated with 
significant erosion and loss of the late-Pliocene stratigraphic record 
(Fig. 2). This step is taken to represent a major expansion of the ice sheet. 

The ~40-kyr-year variability in the size of the WAIS suggests 
differences from Milankovitch’s hypothesis that summer half-year 
insolation intensity, with its sensitivity to precession (Fig. 2e), should 
control the growth and decay of ice sheets. Historically, the lack of 
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precession in the geological record has been attributed to the 
importance of annual insolation that is controlled by obliquity, with 
more influence on polar temperatures than seasonal insolation 
modulated by precession (see, for example, ref. 28). Figure 2 high- 
lights the relationship between obliquity (Fig. 2c), mean annual 
insolation (Fig. 2d) and the AND-1B cycles for the Pliocene, and 
Figs 3 and 4 show details for the early Pliocene. Given the sensitivity 
of WAIS mass balance to ocean temperature’, we suggest that 40-kyr 
orbital cycles may regulate southward export and upwelling of 
Circumpolar Deep Water with consequences for melt rates at 
grounding lines of Antarctic ice sheets. Several recent studies have 
linked changes in Atlantic meridional overturning” and Antarctic 
circumpolar ocean circulation® to obliquity forcing. An interglacial 
mechanism has been proposed whereby the southward expansion of 
westerly winds and associated northward Ekman transport is com- 
pensated for by enhanced upwelling of warmer, CO,-rich 
Circumpolar Deep Water*®, which also promotes atmospheric 
warming. Such a positive feedback is supported by the strong 
correlation between temperature and CO, in Antarctic ice-core 
records and may accelerate sea-ice loss with attendant changes in 
albedo, further increasing oceanic and atmospheric warming. 

The low abundance of sea-ice-associated diatoms (<5%) in the 
early-Pliocene diatomite intervals of the AND-1B cycles suggests that 
sea surface and air temperatures may have been above freezing for a 
significant part of the austral summer. On long timescales, insolation 
integrated over the length of summer (summer energy) has been 
shown in models to control the surface melting of ice sheets at the 
obliquity period® (Supplementary Information), provided that the 
ablating margin is at high latitude and that the surface temperature 
remains above 0 °C for a significant part of the season”. Although the 
latter condition is not currently met by the Antarctic ice sheet, its 
surface melt threshold may have been exceeded during the early 
Pliocene and may be exceeded again in the next 100 years. Given 
the geological evidence from the AND-1B core, we suggest that the 
mass balance of the early-Pliocene Antarctic ice sheet, although 
primarily controlled by ocean-induced melting’, may also have been 
influenced by surface ablation (Figs 3 and 4). A recent model of the 
early-Pliocene terrestrial EAIS implies sensitivity to surface temper- 
ature on its low-elevation margins at an atmospheric CO; level of 
400 p.p.m. (ref. 32). Furthermore, documented in-phase insolation- 
linked warming during an early-Pleistocene interglacial*’ suggests 
that significant melt may occur during precession-amplified 
obliquity cycles (Supplementary Information). More well-dated 
sediment cores, and future experiments using ice-sheet models 
(see, for example, ref. 7), are planned to assess more fully the different 
influences of orbital forcing on surface ablation versus ocean- 
induced melt at elevated atmospheric CO; levels. 

The unconformity-bounded glacimarine cycles in the AND-1B 
core provide ice-proximal evidence for ~40-kyr oscillations in 
Pliocene WAIS and in some of the EAIS outlet glaciers draining into 
the Ross embayment south of the drill site, a type of behaviour similar 
to that of the unstable Northern Hemisphere ice sheets of the past 
~3 Myr. 


METHODS SUMMARY 


The 98%-complete AND-1B drill core was recovered from ~850 m of water, 
from an 85-m-thick portion of the Ross ice shelf, west of Ross Island (Fig. 1 and 
Supplementary Fig. 1). Drilling used a custom-built riser system embedded into 
the sea floor, enabling continuous wireline diamond-bit coring to a depth of 
1284.87 m.b.s.f. The core was initially processed at the drill site and transported 
to McMurdo Station on Ross Island where it was split, logged, sampled and 
analysed. An initial science report provides a summary of drilling, curation and 
science methods and the initial results that underpin many of the interpretations 
in this paper’'. 

The 38 glacimarine sedimentary cycles are recognized on the basis of the repeti- 
tive vertical occurrence of characteristic facies within the AND-1B core. These 
facies and their interpretations are summarized in Supplementary Table 2, and 
shown for a single cycle in Supplementary Fig. 3. The age model for the AND-1B 
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core was developed from quantitative diatom biostratigraphic and *°Ar/??Ar 
radiometric ages to constrain the correlation between the magnetic polarity stra- 
tigraphy and the geomagnetic polarity timescale. Chronostratigraphic data are 
summarized in Supplementary Table 1. The biostratigraphic and radiometric 
datums alone are not used for absolute age control or correlation of cycles with 
the 8'80 record, because their numeric ages have uncertainties of up to +50 kyr. 
However, their precision is sufficient to match polarity zones, identified in the 
core, with the geomagnetic polarity timescale. These higher-precision (+5 kyr) 
geomagnetic polarity transitions provide the ‘time spikes’ for cycle correlation 
and estimation of cycle duration. For example, the six early-Pliocene glacial cycles 
between 4.896 and 4.493 Myr ago (Fig. 3) are correlated with C3n.3n/C3n.2r (mid 
Gilbert, 4.896-4.631 Myr ago) spanning 265 kyr (Fig. 3). Linear interpolation 
between the dated polarity transitions implies average durations of 39 kyr and 
48 kyr per cycle for the lower three (polarity zone N6) and upper three (polarity 
zone R5) cycles, respectively. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Facies analysis of sedimentary cycles and the interpretation of glacial proximity. 
The individual glacimarine cycles are interpreted to represent ice advance—retreat— 
re-advance of the grounding line of a laterally continuous ice sheet in Ross embay- 
ment, and (from bottom to top) include the following stratigraphic elements. 

(1) Erosion and subglacial deposition by an advancing ice sheet producing a 
GSE that is sharp, planar to sub-horizontal and truncates underlying deposits. 
Syndepositional soft-sediment deformation structures, intraclasts, clastic dykes 
and shearing of lithologies occur above and below the GSE. Subglacial features 
are also indicated by clast orientation and striation within tills. 

(2) An interval of grounding-zone deposition comprising conglomerates and 
poorly sorted stratified sandstones together with stratified diamictite represent a 
range of basal meltwater conditions observed from subpolar to polar environ- 
ments”, and are associated with glacimarine processes including subglacial melt- 
water outwash, proglacial debris-flow deposition, and iceberg rainout. These 
pass upwards into finer-grained terrigenous sediments representing more distal 
deposition from turbid plumes and grounding-line fans as the grounding line 
continues to retreat*”*. In some cases, ice-shelf environments are indicated by 
non-fossiliferous, unbioturbated intervals of interstratified sands and silts occur- 
ring stratigraphically between ice-proximal and open-marine facies”. 

(3) Open-marine hemipelagic and pelagic sedimentation with and without 
iceberg influence. The distal deposits with least glacial influence are diatomites 
representing biogenic sediment accumulation in an ice-free, highly productive 
ocean setting. 

(4) A proglacial facies succession of progressively more proximal grounding- 
zone deposits (as in (2)), represents ice-sheet re-advance and eventual glacial 
overriding at the drill site. This may or may not be preserved below the GSE, 
depending on the degree of erosion on the GSE. 

Several facies commonly associated with proximal glacimarine deposition are 
noted in AND-1B (Supplementary Table 2), including mudstone with dispersed 
clasts, conglomerates, sorted sandstone (with a TAM provenance), conglomerates 
and rhythmically interlaminated couplets of claystone with either siltstone or very 
fine-grained sandstone (see, for example, Supplementary Fig. 3). The changing 
abundance of these facies throughout the core, and comparison with modern-day 
analogues, provides insight into the past extent of subglacial meltwater processes 
following the concepts established for the Cape Roberts Project core**. For 
example, the rhythmically interlaminated claystone with silt/sandstone facies in 
AND-1B are consistent with previously described cyclopsam and cyclopel facies 
from modern temperate to subpolar glacimarine environments in Alaska and the 
Greenland margin, where they are deposited in quiet-water basins by suspension 
settling from meltwater plumes****. These sediments are unknown in modern- 
day Antarctic glacimarine settings, yet are common in temperate to subpolar 
settings”. 

Chronostratigraphic constraints. A preliminary age model for the upper 600 m 
of the drill core constructed from diatom biostratigraphy, and radiometric ages on 
volcanic material, allows a unique correlation of ~36% of the magnetic polarity 
stratigraphy with the geomagnetic polarity timescale. Since publication of this age 
model’', the chronology has been improved by further diatom biostratigraphic 
analysis of the diatomite interval between 460 and 383 m.b.s.f. We now identify a 
significant hiatus of ~800 kyr at the base of a volcaniclastic gravity-flow deposit at 
~440.12 m.b.s.f. Additionally, a new “°Ar/*’Ar potassium feldspar age of 
4.800 + 0.076 Myr on a primary volcanic deposit at 481.80 m.b.s.f. (Supple- 
mentary Information) confirms the correlation of our magnetic polarity stra- 
tigraphy with the geomagnetic polarity timescale for this early-Pliocene interval 
of the core. All chronostratigraphic datums used to constrain estimates of cycle 
duration, and correlations with proxy climate and climate-forcing time series in 
Fig. 2 and Supplementary Fig. 4 for the upper 600m of the AND-1B core, are 
summarized in Supplementary Table 1. 

Early-Pliocene chronostratigraphy between 400 and 600m.b.s.f. Biostra- 
tigraphic event ages include the maximum and minimum ages identified in the 
CONOP constrained Southern Ocean diatom biochronologic calibration of 
ref. 40. The ages for new events described below are determined from recent model 
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runs*'. The first occurrence of Rhizosolenia sp. D is observed at 440.12 m.b.s-f. 
The maximum age of this event is well constrained in the latest versions of the 
CONOP models (which incorporate AND-1B data). Rhizosolenia sp. D has been 
documented in at least seven sections from around the Antarctic’. The 
occurrence of this diatom event, therefore, requires the age of the core at and 
above 440.12 m.b.s.f. to be <3.58 Myr. The palaeomagnetic zone in this portion of 
the core (440.12—346.13 m.b.s.f.) is normal and correlates C2An.3n (base of the 
Gauss chron, 3.596-3.300 Myr ago). The absence of Fragilariopsis interfrigidaria 
and Fragilariopsis barronii between 450 and 440.12 m.b.s.f. implies an age-range 
equivalent depth of 4.72-4.29 m.b.s.f. for this interval and constrains the min- 
imum duration of the unconformity below 440.12 m.b.s.f. to ~800 kyr. The first 
occurrence of Thalassiosira striata is observed at 456.00 m.b.s.f., constraining the 
maximum age of the core at this level to 4.78 Myr and identifying the short 
normal-polarity interval between 459.19 and 452.86 m.b.s.f. as the Nunivak sub- 
chron (C3n.2n, 4.631—4.493 Myr ago). The radiometric age of 4.8 + 0.076 Myron 
the pumice-rich layer at 481.8 m.b.s.f. constrains the superjacent N-R transition 
at 519.4m.b.s.f. to the top of the Sidufjall subchron (4.799 Myr ago). The first 
occurrence of the marine diatom Thalassiosira complicata at 583.64 m.b.s.f. is 
4.86 Myr ago and constrains correlation of the R—-N transition at 596.35 m.b.s.f. 
with the base of the Sidufjall subchron (4.896 Myr ago). 

Calibration of the benthic foraminiferal 5'°O curve to sea level. The stacked 
benthic foraminiferal 5'°O record for the last 5 Myr (ref. 25) was converted to a 
sea-level curve by scaling with a calibration of 0.011%o m_', which is consistent 
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Modelling West Antarctic ice sheet growth and 
collapse through the past five million years 


David Pollard’ & Robert M. DeConto” 


The West Antarctic ice sheet (WAIS), with ice volume equivalent to 
~5m of sea level’, has long been considered capable of past and 
future catastrophic collapse**. Today, the ice sheet is fringed by 
vulnerable floating ice shelves that buttress the fast flow of inland 
ice streams. Grounding lines are several hundred metres below sea 
level and the bed deepens upstream, raising the prospect of runaway 
retreat’. Projections of future WAIS behaviour have been 
hampered by limited understanding of past variations and their 
underlying forcing mechanisms®’. Its variation since the Last 
Glacial Maximum is best known, with grounding lines advancing 
to the continental-shelf edges around ~15 kyr ago before retreating 
to near-modern locations by ~3 kyr ago®. Prior collapses during 
the warmth of the early Pliocene epoch’ and some Pleistocene 
interglacials have been suggested indirectly from records of sea level 
and deep-sea-core isotopes, and by the discovery of open-ocean 
diatoms in subglacial sediments’®. Until now'', however, little direct 
evidence of such behaviour has been available. Here we use a 
combined ice sheet/ice shelf model’? capable of high-resolution 
nesting with a new treatment of grounding-line dynamics and ice- 
shelf buttressing” to simulate Antarctic ice sheet variations over the 
past five million years. Modelled WAIS variations range from full 
glacial extents with grounding lines near the continental shelf break, 
intermediate states similar to modern, and brief but dramatic 
retreats, leaving only small, isolated ice caps on West Antarctic 
islands. Transitions between glacial, intermediate and collapsed 
states are relatively rapid, taking one to several thousand years. 
Our simulation is in good agreement with a new sediment record 
(ANDRILL AND-1B) recovered from the western Ross Sea", indi- 
cating a long-term trend from more frequently collapsed to more 
glaciated states, dominant 40-kyr cyclicity in the Pliocene, and 
major retreats at marine isotope stage 31 (~1.07 Myr ago) and other 
super-interglacials. 

Large-scale modelling of the WAIS requires an ice-sheet model that 
combines the flow regimes of grounded and floating ice efficiently 
enough to allow simulations of ~10° yr or more. This is challenging, 
because the scaled equations for the two regimes are very different, and 
near the grounding line they interact in a boundary-layer zone that 
affects the large-scale dynamics’. More rigorous higher-order flow 
models without separate scalings are currently too computationally 
expensive for long-term continental applications’’. Our approach 
simply combines the scaled sheet and shelf equations’, while capturing 
grounding-line effects by imposing a new mass-flux condition’. Other 
standard model components predict variations in ice thickness, ice 
temperatures, and bedrock elevation below the ice (see Methods). 

The multi-million-year timescales considered here are beyond the 
capability of most climate models to provide the necessary time- 
continuous forcings required by the ice sheet model. Instead we 
use techniques similar to those used in previous studies®’ and drive 


the model with simple parameterizations of surface mass balance, air 
temperature and specified sea level. A new parameterization of sub- 
ice-shelf ocean melt based on modern observations'*"° accounts for 
changes in the shape of coastlines and distance from the ice edge to 
open ocean”’ (see Methods). 

Before considering long-term simulations, it is helpful to examine 
the link between equilibrated ice-sheet states and the strength of 
various forcing mechanisms (Fig. 1) representative of extreme inter- 
glacial (left of graphs), modern interglacial (middle) and full glacial 
(right) conditions. In between the values shown, each forcing is 
linearly interpolated along the x axis. This closely approximates how 
they co-vary in long-term simulations, but not exactly due to inde- 
pendent influences of 5/80 and austral insolation (see below). The 
envelopes of ocean-melt values are chosen so that complete WAIS 
collapse and full glacial expansion are just attained. 

Figure la indicates a smoothly varying response from intermediate 
to large WAIS sizes, with sharper transitions into and out of extreme 
interglacials (collapses), and also back from full glacial to intermediate 
states. This behaviour is seen in long-term simulations and anima- 
tions (Supplementary Videos 1, 2), with rapid transitions taking from 
one to several thousand years. The whole range of Antarctic states in 
the model is more or less ‘one-dimensional’, that is, the Ross, Weddell 
and Amundsen Sea sectors of the WAIS usually retreat and expand in 
unison, resulting in just one type of configuration for a given total ice 
volume. This suggests that the broad-scale Plio-Pleistocene history of 
the WAIS is represented at the ANDRILL AND-1B drill site’, and 
persistent absence of a Ross ice shelf is indeed indicative of major 
WAIS retreat. 

The relative importance of individual forcing mechanisms is shown 
in Fig. 1b. For modern to extreme interglacial conditions, changes in 
surface climate and sea level are relatively small, while changes in 
ocean melt are dominant via their effect on ice-shelf buttressing. For 
modern to glacial conditions, a combination of ocean-melt and sea- 
level changes is needed to produce realistic WAIS expansion®”’. 
Changes in precipitation and surface temperature have significant, 
but largely cancelling, effects: without reduced precipitation in cooler 
climates, glacial volumes are too large (‘no AP’, Fig. 1b); without the 
effects of cooler surface temperatures on internal ice temperatures, 
viscosities and basal sliding, glacial ice flows too easily and volumes are 
too small (‘no AT’, Fig. 1b). 

A five-million-year simulation (Fig. 2) is performed from the early 
Pliocene to present, with the long-term variation of each forcing 
mechanism parameterized largely as a function of deep-sea-core 
5/80 (ref. 18). Sea level over most of this interval is dominated by 
Northern Hemispheric ice volume, and can be readily prescribed in 
proportion to 8'°O. The responses of Antarctic surface temperature 
and precipitation to Pleistocene glacial cycles are also reasonably con- 
strained by climate studies and observations, and we adapt established 
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Figure 1 | Equilibrium West Antarctic ice volumes versus specified forcing, 
and ice-sheet configurations. Left panels, ice volumes. The four forcing 
mechanisms are sub-ice-shelf oceanic melting and departures of sea level, 
annual precipitation AP and temperature AT from present. The three sets of 
forcing values represent climates for extreme interglacial (left), modern 
interglacial (middle), and full glacial (right). In between, each forcing is 
linearly interpolated along the x-axis (w,, see equation (6) in Methods). The 
triplets of sub-ice oceanic melt rates are for protected, exposed-shelf, and 
deep-ocean regions—[M,, M., Mg], equations (7) and (8) in Methods. a, All 
forcing mechanisms changed together. Solid (dashed) curves are generated 
with ice sheets initialized from prior solutions representing cooling 
(warming) trends. The slight difference between the two curves contrasts 
with the much larger hysteresis of East Antarctica, where surface melt and 
not sub-ice-shelf melt is the dominant ablation process**. b, With one 
mechanism held constant at its modern value, and all others changed. Right 
panels, ice sheet configurations representative of the three climatic states, 
with the black dot showing the location of the ANDRILL AND-1B drill site’’. 


parameterizations using 5'8O and austral insolation as inputs (equa- 
tions (1) and (2) in Methods). 

Factors controlling past variations of oceanic sub-ice melt on ~104 yr 
timescales are less certain. Sub-ice oceanic melting is affected in part by 
circum-Antarctic deep-water (CDW) warmth and its incursions onto 
continental shelves’. We argue that CDW and sub-ice melt have been 
mainly controlled by far-field climatic influences that vary in step with 
Northern Hemispheric glacial—interglacial cycles (see Methods). 
Without identifying the explicit link (which may involve atmospheric 
CO,, meridional overturning circulation, sea level, or other global-scale 
teleconnections), we hypothesize that temporal variations of Antarctic 
sub-ice ocean melt rates are represented by records that correlate with 
Northern Hemispheric glacial variations, that is, deep-sea-core 5'°O 
(equations (6)—(8) in Methods). A minor additional influence on sub- 
ice melt from austral summer orbital insolation anomalies” is also 
needed to produce precessional cyclicity like that observed during 
marine isotope stage 31 (MIS 31) around 1 Myr ago’’*’. Our forcing 
is warmest during the early Pliocene warm period (~5 to ~3 Myr ago) 
due to light 5'*O values at that time; however, the parameterizations are 
based more on Pleistocene variations, and may not fully represent the 
warm Pliocene if unique processes (for example, persistent El Nifio)’ 
were involved. 

With long-term forcing variations mainly following deep-sea-core 
5'%0 (ref. 18), the ice-sheet model is continuously integrated over the 
past 5 Myr (Fig. 2). Except for small variations along the Wilkes 
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Figure 2 | Simulated total Antarctic ice volume over the past five million 
years. a, Stacked deep-sea-core benthic 3180 (ref. 18). b, Total Antarctic ice 
volume (red line) in a long-term simulation with variations of sub-ice melt 
and other forcings parameterized mainly from the deep-sea-core 5'°O 
record. Equivalent changes in global sea level are shown on the right, 
accounting for the fraction of grounded ice above sea level compared to that 
below sea level’. Bars along the x-axis indicate conditions at a single location 
(78.0° S, 169.4° E), shifted one grid box to the east of AND-1B"' to avoid 
poorly resolved Ross Island shorelines (yellow, open ocean; blue, floating ice 
shelf; green, grounded ice). Yellow and blue/green here correspond to the 
AND-1B diatomite (yellow) and diamictite (green) intervals in Fig. 2 of ref. 
11. ¢, As b but with the time axis expanded over the past 1.5 Myr. Grey 
shading indicates simulated super-interglacials, beginning with MIS 317’. 


margin” and in inlets such as Prydz bay”’, East Antarctica is stable 
throughout the simulation and nearly all of the ice-volume variability 
is due to West Antarctica. Several key aspects of the model time series 
agree with the AND-1B core''. There is an overall progression from 
predominantly smaller WAIS sizes to larger. Furthermore, intervals 
of WAIS collapse with little or no marine ice are much more common 
from ~5 to 3 Myr ago, which is consistent with intervals in the 
drillcore dominated by diatomaceous sediments indicating warmer 
sea surface temperatures, little or no summer sea ice, and an open 
marine Ross embayment”’. In fact, the two thickest diatomaceous 
intervals in the core, between ~4.3 and 3.4 Myr ago, correspond to 
the period with the most frequent and prolonged WAIS collapses 
simulated by the model. These collapses could well be continuous 
if additional Pliocene warm-period forcing was added”. After 3 Myr 
ago, there are longer intervals with modern-to-glacial ice volumes, 
that is, with ice-shelf or grounded-ice cover at or near the AND-1B 
site (Fig. 2), again in rough agreement with the increasing predomi- 
nance of diamictite after 3 Myr ago indicating overriding ice or a 
proximal grounding zone’. 

Brief WAIS super-interglacial collapses occur after 3 Myr ago but 
with decreased frequency. In some cases, these precisely match the 
thinner diatomaceous intervals in the AND-1B core, including the 
well-dated MIS 31 event at 1.07 Myr ago''”". The large 100-kyr fluc- 
tuations of the past million years are similar to those modelled in 
earlier studies®”’’. The last retreat of WAIS from ~15 kyr ago to the 
present roughly matches the observed retreat of Ross Sea grounding 
lines”, and is particularly realistic with modifications described in 
Supplementary Information section 6. 

The model predicts several major WAIS collapses during Pleistocene 
interglacials (Fig. 2c), at times when 5'°O minima coincide with strong 
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austral summer insolation anomalies. The simulated collapse at MIS 31 
corresponds well with core evidence'’”', both in terms of timing and 
magnitude, but more recent collapses (for example, ~200 kyr ago) do 
not always coincide with the late-Pleistocene interglacials (~125 kyr 
ago and ~400kyr ago) usually suspected of harbouring such 
events'’®*°’”, Thus, while the total number of collapses is reasonable, 
their sometimes imprecise timings may reflect the limitations of our 
simple forcing parameterizations, including uncertainties in the 40-kyr 
phase relationship of Antarctic sub-ice melt to deep-sea-core 5'°O 
records, and the influence of local orbital insolation forcing. 
Although the model clearly captures the overall ~40-kyr periodicity 
seen in the AND-1B record, the precise phasing between Antarctic ice 
sheet variations and Northern Hemispheric climate changes remains 
uncertain. In some instances, the timing of our simulated super- 
interglacials may be an artefact of the phasing between the imposed 
5'°O and austral summer insolation forcings. Recent observational 
and modelling studies on the relative timing of Northern 
Hemisphere ice volume variations, ocean meridional overturning 
and orbital forcing” are pertinent to this issue, but with no clear 
consensus to date. These relationships could also be explored in future 
work with global climate models in combination with regional circum- 
Antarctic and sub-ice-shelf ocean modelling'*””, to better ascertain the 
effects of Northern Hemispheric glacial cycles, orbital forcing and 
greenhouse gas concentrations on regional Antarctic conditions. 

To better focus on the Ross embayment and the AND-1B site'', we 
ran higher-resolution (10 km) nested ice sheet-shelf simulations for 
particular times, with boundary conditions at the domain edges 


1.094 Myr ago 


Figure 3 | Snapshots at particular times from the long-term simulation 

in Fig. 2. Shown are 1.094 Myr ago, 1.079 Myr ago (MIS 31 retreat) and 
modern. a—c, Grounded ice elevations and floating ice thicknesses, shown 
respectively (in m) by upper and lower colour scale on right. d—f, Surface ice 
speeds (myr '), from higher-resolution (10 km) nested runs over the Ross 


1.079 Myr ago 
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obtained from the long-term all-Antarctic simulation. Figure 3 illus- 
trates a wide range of WAIS states, from weak glacial, full WAIS collapse, 
to modern conditions. The modern network and behaviour of Siple 
Coast ice streams and Transantarctic outlet glaciers is well resolved 
(Fig. 3f), with some ice streams stagnating and re-activating over the 
several thousand years of the nested run*' (Supplementary Videos 3 and 
4). Ross ice shelf velocities are also similar to observations, as is the 
central streamline dividing Siple (West Antarctic) and Transantarcic 
(East Antarctic) ice*'. The finer ice grid resolves the general ice flow 
around Ross Island, although the details of flow are not fully resolved 
within the narrow confines of McMurdo Sound containing AND-1B 
and other drill sites'’’?. When shelf ice is present at AND-1B (Fig. 31), 
offshore flow just to the east is always northward, with ice originating 
from major Transantarctic outlet glaciers to the south (Byrd, Skelton, 
Mulock). This offshore flow pattern and its Transantarctic provenance 
prevail whenever there is shelf ice around Ross Island. 

The dominant regional control is the overall strength of sub-ice 
oceanic melting in the Ross embayment, which causes both Siple- 
and Transantarctic-sourced ice to recede or advance in concert over 
the eastern and western sides of the embayment, respectively. It is very 
rare for one type or the other to dominate. Thus, although the 
provenance of shelf ice around Ross Island may be insensitive to the 
overall WAIS state, the basic presence or absence of shelfice at the AND- 
1B site'' is a good qualitative indicator of maxima and minima in WAIS 
ice volume (Fig. 2). Other sites not yet cored in the central Ross embay- 
ment may offer even better potential for uniquely identifying times of 
WAIS collapse (Supplementary Fig. 3). These simulations show how 


Modern 


+ = 2,000 m yr 


embayment for the same three times, showing the whole nested domain. 
g-i, Floating ice thicknesses (m) and velocity vectors from the nested 
simulations, enlarged over the western Ross embayment. Vectors are shown 
only every third grid point for clarity. The location of AND-1B is shown by 
the black dot. 
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local observables in the AND-1B and other cores relate to overall WAIS 
evolution. In particular, our results imply that the presence or absence 
of grounded or floating ice in the vicinity of McMurdo Sound is indeed 
linked to WAIS ice volume, and that open-water conditions in the Ross 
Sea are indicative of partial to complete collapse of the WAIS. 

Some of our results are independent of the parameterized temporal 
variations in long-term forcing. For example, the estimated magni- 
tudes of sub-ice oceanic melt rates needed to produce full WAIS 
amplitudes (Fig. 1 and Methods) form a point of reference for future 
modelling. Another independent result is the tendency for the WAIS 
to experience relatively rapid transitions within one to a few thousand 
years, as forcing is smoothly varied. This includes transitions into and 
out of collapsed states, and from full glacial to modern-like ice extents. 
A collapse from modern conditions occurs when sub-ice ocean melt- 
ing increases from 0.1 to 2myr~' under shelf interiors, and from 5 to 
10myr_ ' near exposed shelf edges (M, and M. respectively, in equa- 
tions (3), (7) and (8) in Methods). Recent melt rates under small 
Antarctic ice shelves are inferred to be increasing dramatically'’>"*. 
The relationship between sub-ice melt rates and ocean temperatures 
is just beginning to be explored’’, but those data’*'® and simplified 
modelling* suggest relationships on the order of 10myr' °C! for 
smaller shelves, and 0.4m yr '°C~' for whole-shelf averages under 
the major Ross and Filchner-Ronne shelves. Dividing our interior- 
melt (M,) increase of 1.9myr ' by the latter sensitivity of 
0.4myr_ '°C~! suggests that the WAIS will begin to collapse when 
nearby ocean temperatures warm by roughly 5 °C. Global climate and 
regional ocean modelling is needed to predict when and if future 
ocean temperatures and melt rates under the major Antarctic ice 
shelves will increase by these amounts, and if so, for how long. 


METHODS SUMMARY 


The scaled dynamical equations for sheet flow (shallow ice approximation) and 
shelf flow can be combined heuristically'*. However, for efficiency in these long- 
term simulations, they are applied separately depending on whether ice is 
grounded or floating. Despite this simplification and coarse grids, the effects 
of the grounding-line boundary layer are captured by imposing a mass-flux 
condition across the grounding line following ref. 5, which sets ice velocities 
there as a function of ice thickness. To include important effects of ice-shelf 
buttressing, the imposed grounding-line velocities are reduced depending on the 
ratio of longitudinal stress to its free-floating value’ (see Supplementary 
Information). The model also contains three other standard components: (1) 
an ice-mass advection equation predicting ice thickness and accounting for 
surface accumulation minus ablation and basal melt, (2) an ice temperature 
equation including horizontal advection, vertical diffusion and shear heating, 
and (3) a bedrock elevation equation with local relaxation towards isostatic 
equilibrium and elastic lithospheric flexure®’. There is no explicit basal hydro- 
logy, other than allowing basal sliding only where the bed is at the melt point. 
Equilibrium ice-free topography and bathymetry are prescribed from the 
modern BEDMAP database’, by removing all ice and allowing the bed to 
rebound isostatically. Prescribed basal sliding coefficients crudely represent 
the likely spatial distribution of deformable sediment versus hard bedrock, that 
is, sediment where the ice-free rebounded topography is below sea level (mostly 
WAIS) and bedrock where above (mostly East Antarctic ice sheet). In addition, 
intermediate basal stiffness is prescribed in the Pine Island/Thwaites drainage 
sector and Transantarctic inlets below sea level, to improve modern grounding- 
line locations and glacier velocities there. Past surface mass balance and sub-ice- 
shelf oceanic melting are parameterized using deep-sea-core 5'°O and orbital 
insolation variations (see Methods). The model is run on a polar stereographic 
grid, with 40 km resolution for continental and 10 km for nested experiments. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Modern climatic forcing: temperature and precipitation. Modern forcing 
fields of annual surface mass-balance and temperature are specified using simple 
empirical parameterizations, and then varied in the past depending on ice-core 
or deep-sea-core time series, similarly to previous studies®*’. Annual surface 
temperatures (°C) are®? 


T = Tm + 34.46 — 0.00914 h, — 0.68775|f| + 0.1 Aq, + 10As/125 (1) 


where T,, =0°C, h, is elevation (m), |@| is latitude (°S), Aq, is annual orbital 
insolation anomaly from present at 80° S (W m_*), and Asis sea-level departure 
from present (m) representing atmospheric CO, (see equation (6) below). 
Annual precipitation P (myr_') is parameterized via temperature™: 


P = 15 x 2f-Tmj/l0 (2) 


The fraction of precipitation falling as snow, and annual surface melt if any, 

are computed from T using a positive-degree-day (PDD)* method with coef- 
ficient 0.005 m per degree-day. A sinusoidal seasonal temperature cycle of ampli- 
tude 0.1Aq, (°C) is assumed, where Aq, is January-minus-July 80° S insolation 
(Wm ”). (Very little surface-melt occurs in our simulations, because summer 
air temperatures remain below freezing everywhere.) 
Modern climatic forcing: sub-ice-shelf oceanic melt. A new parameterization 
of oceanic melt rates is used, based on the degree of protection by islands and 
bays, and distance to ice-shelf edge’’. Although simple, it captures basic features 
of other studies, such as rapid melting near edges'*’”***’, and yields reasonable 
modern shelf distributions. Modern sub-ice melt M (m yr') is 


M (1—za) [(—ze) Mp + Ze Me] + 2aMa (3) 


where the “deep-ocean’ weighting is 
Zi = max{[0, min[1, (4 —1400)/200 | | (4) 
and the ‘exposed-shelf weighting is 


Z = max{0, min[1, (A—80)/30]] e~?/!° (5) 


Here max[¥, y] indicates the greater of x and y, and min[x, y] indicates the lesser. 
The 3 modern oceanic melt rates M,,, M. and M,in equation (3) are for protected, 
exposed-shelf and deep-ocean areas, respectively, given by M, = 0.1 myr |, 
M.=5myr_', and My=5myr '. In equations (4) and (5), h, is bathymetry 
(m), A is the angle (degrees) subtended by the set of all straight lines from the 
point in question that reach open ocean without encountering land or grounded 
ice, and D (km) is the sub-ice distance to the closest open-ocean point. The angle 
Ais the main way we achieve realistic modern ice-shelf edges. Around most major 
Antarctic shelf-edges today, A is ~90° to 100°; whether this coincidence has a 
physical basis requires exploration with regional ocean models. 
Past climatic forcing: sea level, temperature and precipitation. We need to 
prescribe long-term variations of sub-ice oceanic melt rates, sea level, air tem- 
perature and precipitation over the past 5 Myr. On longer timescales, atmo- 
spheric CO, levels outside the Plio-Pleistocene range (~180-380 p.p.m.v.), 
basal sediment changes, and tectonic uplift or subsidence are probably import- 
ant, but were probably minor through the Plio-Pleistocene. As mentioned above, 
our Pleistocene-centric parameterizations may underestimate warmth during 
the early Pliocene ~5—3 Myr ago when CO, levels rose to ~380 p.p.m.v. (ref. 9). 
Sea-level variations have been dominated by Northern Hemispheric ice 
volume, and are assumed proportional to deep-sea core 5'%O and calibrated as 
in equation (6). Past variations of Antarctic annual surface temperatures are 
included in equation (1), proportional to a combination of atmospheric CO 
(which is represented by 5'°O via sea level in equation (1), since all three are 
highly correlated in the Pleistocene at least) and the annual 80° S insolation 
anomaly. Past variations in precipitation depend on air temperature, just as 
for modern spatial variations (equation (2)). 
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Past climatic forcing: sub-ice-shelf oceanic melt. The long-term controls of 
sub-ice-shelf melting are just beginning to be explored'*!*”’. Here we propose a 
parameterization based on simple reasoning and sensitivity tests of WAIS retreat 
since 15 kyr ago. This last deglacial retreat is the only well-documented WAIS 
variation on 10*-year time scales. It cannot have been driven by surface mass 
balance, because Antarctic precipitation has increased, not decreased, and there 
has been negligible surface melt during this time. Model sensitivity tests show that 
sea-level rise alone, and/or the influence of warming temperatures on ice viscosity 
and basal sliding, account for only a small fraction of the observed retreat. 
Therefore, increases in sub-ice melting must have been key. They could reasonably 
have been driven either by regional Southern Hemispheric orbital insolation 
changes, or by global-scale far-field influences. Southern Hemispheric insolation 
is unlikely to have been the dominant driver, because (1) the summertime 80° S 
anomaly from present was small and negative between 15 and 2 kyr ago, and (2) 
the annual 80° S anomaly, with minimum at 28.7 kyr ago and maximum at 9.5 kyr 
ago (ref. 20), would have caused retreat to commence too early (before ~19 kyr 
ago) judging from Ross Sea grounding-line history (~10 kyr ago)**”*. This is 
borne out by sensitivity tests (Supplementary Fig. 5) in which austral insolation 
is used as the sole driver of sub-ice melt, and results over the past 15,000 years are 
unreasonable. Realistic retreat is obtained only if sub-ice melt varies in step with 
far-field forcing. 

This suggests that sub-ice melt has been controlled not by local forcing or 
austral insolation, but by far-field climatic influences that vary in step with 
Northern Hemispheric glacial—interglacial cycles at least since ~2.5 Myr ago. 
The latter is represented here by a stacked deep-sea-core 5'°O record spanning 
the past 5 Myr (ref. 18). A small influence of austral summer insolation” is added 
to produce minor observed 20-kyr cyclicity during warm events such as MIS 317". 
First, a weighting index w, is defined by 


We = max[0, min{ 2, 1+ As/85 + max (0, Aq/40) | | (6) 


where 5'%O is represented by As, the sea-level departure from present (m, scaled 
to 8'%O with last-glacial-maximum 125m lower than present), and Ag; is the 
January 80° S insolation anomaly from present (W m *). Sub-ice-melt rates for 
protected, exposed-shelf and deep-sea areas ([M,,M.,M,] respectively, in 
myr') are specified as [0,0,2] for maximum-glacial conditions, [0.1,5,5] for 
modern, and [2,10,10] for extreme-interglacial conditions. Then the triplet used 
in equation (3) to determine M for any past time is 


[Mp.Me,Ma] = (1— wg) (0, 0, 2] + wg [0.1,5,5] if 0<we<1 (7) 


or 


[Mp.Me,Ma] = (2—wg) (0.1, 5, 5] + (wg—1) (2, 10,10] if l1<we<2 (8) 


The modern triplet values are chosen to yield reasonable results for today’s Ross 
and Filchner-Ronne ice shelves. The glacial and warm triplets and the form of wg 
in equation (6) are chosen so that the model just attains full-glacial WAIS extents 
and complete interglacial collapses in long-term simulations and in Fig. 1. These 
values cannot be changed by large amounts without substantial degradation of 
our results. 
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An Early Cretaceous heterodontosaurid dinosaur with 
filamentous integumentary structures 


Xiao-Ting Zheng’, Hai-Lu You’, Xing Xu° & Zhi-Ming Dong” 


Ornithischia is one of the two major groups of dinosaurs, with hetero- 
dontosauridae as one of its major clades. Heterodontosauridae 
is characterized by small, gracile bodies and a problematic phylo- 
genetic position’”. Recent phylogenetic work indicates that it repre- 
sents the most basal group of all well-known ornithischians’. Previous 
heterodontosaurid records are mainly from the Early Jurassic period 
(205-190 million years ago) of Africa’’. Here we report a new hetero- 
dontosaurid, Tianyulong confuciusi gen. et sp. nov., from the Early 
Cretaceous period (144-99 million years ago) of western Liaoning 
Province, China. Tianyulong extends the geographical distribution 
of heterodontosaurids to Asia and confirms the clade’s previously 
questionable temporal range extension into the Early Cretaceous 
period. More surprisingly, Tianyulong bears long, singular and 
unbranched filamentous integumentary (outer skin) structures. 
This represents the first confirmed report, to our knowledge, of fila- 
mentous integumentary structures in an ornithischian dinosaur. 


Dinosauria Owen, 1842 
Ornithischia Seeley, 1887 
Heterodontosauridae Kuhn, 1966 
Tianyulong confuciusi gen. et sp. nov. 


Etymology. The generic name refers to Shandong Tianyu Museum of 
Nature (STMN; Pingyi, Shandong Province, China), where the spe- 
cimen is housed; the specific name is dedicated to Confucius, the 
founder of Confucianism. 

Holotype. STMN 26-3 is an incomplete skeleton preserving a partial 
skull and mandible, partial presacral vertebrae, proximal—middle 
caudal vertebrae, nearly complete right scapula, both humeri, the 
proximal end of the left ulna, partial pubes, both ischia, both femora, 
the right tibia and fibula and pes, and filamentous integumentary 
structures. 

Locality and horizon. Jianchang County, Liaoning Province, China. 
Jehol Group, Early Cretaceous*”. 

Diagnosis. A heterodontosaurid dinosaur distinguishable from other 
species in having a semi-circular diastema, a barely developed buccal 
emargination, a dorsally placed mandibular jaw joint well above the 
level of the dentary tooth row, only one premaxillary tooth (which is 
caniniform), a distinct gap between the dentary teeth and coronoid 
process, caudally increasing tooth sizes, and adjacent maxillary and 
dentary tooth crowns distinctly separate and lacking mesial and distal 
marginal denticles and ridges on the buccal surfaces. 

Description. The T. confuciusi holotype is estimated to be 70cm 
long, with a cranium that is 6cm long and tail that is 44cm long 
based on the proportions of Heterodontosaurus tucki'® (Fig. 1a, b; also 
see Supplementary Information for measurements). The holotype 
probably represents a subadult individual because the sutural lines 
between the neural arches and the centra are still visible, whereas the 
ends of the preserved long bones are all ossified. 


The partial skull, exposed in right lateral view, is missing most of its 
caudodorsal portion, including the braincase (Fig. Ic, d). A relatively long 
preorbital portion occupies about 46% of the basal skull length. The 
ventral edge of the premaxilla lies well below the maxillary tooth row, 
whereas its dorsal (nasal) process contacts the nasal. A distinct semi- 
circular diastema is demarcated on the lateral surface across the junction 
between the premaxilla and the maxilla. Caudal to the diastema, a large, 
triangular antorbital fossa occupies almost the entire caudal half of the 
lateral facial area. The long rostrodorsal (maxillary) process of the jugal 
tapers towards its rostral end and inserts between the maxilla and the 
lacrimal. The lateral mandibular articular condyle of the quadrate is 
slightly larger than the medial one. The predentary has a smooth, 
horizontal oral margin of equal length as the oral margin of the premaxilla. 
The dentary has roughly parallel dorsal and ventral margins, and its 
horizontal tooth row occupies its rostral two-thirds. A slender external 
mandibular fenestra is surrounded by the dentary, surangular and 
angular. The ventral margin of the angular tilts caudodorsally, positioning 
the jaw joint well above the level of the dentary tooth row. Only one large, 
caniniform premaxillary tooth exists and protrudes from the caudal end 
of the oral margin of the premaxilla. The maxillary tooth crowns are 
triangular in lateral view and lack both denticles along the mesial and 
distal edges and ridges on the buccal surface. Adjacent crowns are clearly 
separate from each other. Compared to the maxillary crowns, individual 
dentary teeth are relatively small and have more pointed apical ends. 

The cervical vertebrae are slightly downturned, whereas the dorsals 
are arched dorsally. Most of the 22 proximal—middle caudals are tightly 
bound by longitudinally arranged ossified epaxial and hypaxial 
tendons. The relatively long, straight humerus has a prominent 
deltopectoral crest. The ischium is straight, compressed mediolaterally, 
and does not possess tab-shaped obturator process. The postpubis is 
about half the length of the ischium. The distal end of the femur has an 
intercondylar extensor groove and an open flexor intercondylar 
groove. The four preserved metatarsals are closely appressed to each 
other; metatarsal I is reduced, and its first phalanx does not extend 
beyond the proximal end of metatarsal II. 

Three patches of filamentous integumentary structures are pre- 
served: one ventral to the cervical vertebrae (Fig. 2a), one dorsal to 
the dorsal vertebrae (Fig. 2b), and the largest dorsal to the proximal 
caudal vertebrae (Fig. 2c). The filaments in each patch largely parallel 
each other; those ventral to the body angle about 45° caudally, whereas 
those dorsal to the body lie almost perpendicular to the long axes of 
their respective parts of the vertebral column but curve gently. We 
interpret these as integumentary structures based on their general 
morphology and distribution along the fossil body. The elongation 
and hollow nature of these filaments argues against them being sub- 
dermal structures, such as collagen fibres”*. 

The tail filaments are especially long, with estimated overall lengths of 
60 mm (seven times the lengths of the underlying caudal centra) and 


"Shandong Tianyu Museum of Nature, Lianhuashan Road West, Pingyi, Shandong, 273300, China. “Institute of Geology, Chinese Academy of Geological Sciences, 26 Baiwanzhuang 
Road, Beijing 100037, China. “Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, 142 Xiwai Street, Beijing 100044, China. 
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Figure 1| New heterodontosaurid ornithischian dinosaur T. confuciusi. 

a, Main slab of the holotype (STMN 26-3). b, Broken slab. The scale bar in 
b refers also to a. ¢, Close-up of skull and mandible. d, Interpretive drawing 
of skull and mandible. e, Close-up of dentition. Abbreviations: a, angular; 
aof, antorbital fossa; ca, caudal vertebrae; cv, cervical vertebrae; d, dentary; 


widths of 0.4 mm. Although their bases are not clearly preserved, all the 
filaments seem to be singular, showing no evidence of branching. The 
filaments appear to have been relatively rigid because none are wavy or 
bent and they do not deviate along their lengths from otherwise gentle 
curvatures. The widths of all filaments appear similar in each patch, 
which strongly suggests that they have roughly constant radial dimen- 
sions and therefore circular cross sections. Although largely overlapping, 
several isolated, partial filaments on the tail also display dark, longit- 
udinal ‘stripes’, implying that each had a tubular structure (Fig. 2d). 
Given the stronger evidence for a tubular structure and their 
apparent rigidity, the filamentous integumentary structures of 
Tianyulong are more similar to those preserved on the tail of a spe- 
cimen of Psittacosaurus’ than to the ‘protofeather’ structures in both 
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dy, dorsal vertebrae; emf, external mandibular fenestra; en, external naris; f, 
femur; h, humerus; isc, ischium; j, jugal; 1, lacrimal; m, maxilla; n, nasal; pd, 
predentary; pf, prefrontal; pm, premaxilla; po, postorbital; pub, pubis; q, 
quadrate; qj, quadratojugal; scaco, scapulocoracoid; sa, surangular; tf, tibia 
and fibula. 


avian and non-avian theropods'®"’. However, the integumentary 


structures in Psittacosaurus are preserved only on the tail, and not 
distributed more widely along the body as in Tianyulong. Also, those 
in Psittacosaurus are more rigid and more widely separated from each 
other than in Tianyulong. The central tail feathers of the basal avialan 
Epidexipteryx'* (Fig. 2e), the basal pygostylian (all birds with 
shortened tails) Confuciusornis'? and some enantiornithean (one of 
the two major clades of early birds, which became extinct by the end of 
Cretaceous period) birds'*”’ are also elongate, and represent a feather 
type unknown in extent birds. Unlike the structures in Tianyulong, 
however, they are ribbon-like and branch at their distal ends’®. 
Among the ‘protofeathers’ of theropods, the integumentary growths 
of Tianyulong are more similar to those of Sinosauropteryx'® than to 
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those of Sinornithosaurus’’ (Fig. 2f) and others. In both Tianyulongand 
Sinosauropteryx, the filamentous structures are singular and 
unbranched, although the ‘protofeathers’ in the similar-sized 
Sinosauropteryx are much shorter and more slender than the caudal 
structures in Tianyulong, with lengths of less than 30 mm and widths of 
less than 0.2 mm’*. In many of the theropods, the ‘protofeathers’ are 
more or less bundled together (Fig. 2f, left); in others, they exhibit a 
clear branching pattern (Fig. 2f, right). Overall, they appear to have 
been much more pliable'®’*”*. 

Phylogenetic analysis recovers Tianyulong as a member of 
Heterodontosauridae, which, in turn, is the basalmost coherent group 
among all well-known ornithischians’ (Fig. 3; also see Supplementary 
Information). Shared derived features of heterodontosaurids include: 
ventral deflection of the oral margin of the premaxilla ventral to the 
maxillary tooth row, a large premaxilla—maxilla diastema at least one 
crown in length, quadrate with larger lateral than medial mandibular 
condyle, and a large, caudally placed caniniform premaxillary tooth; 
all these features are present in Tianyulong. Besides three heterodon- 
tosaurid genera from the Early Jurassic of southern Africa’’’, other 
reports include one genus from the Early Cretaceous of England”’, as 
well as fragmentary Middle and Late Jurassic—Early Cretaceous 
material from Europe’””® and the Late Jurassic of North America”’. 
Thus, Tianyulong expands the distribution of heterodontosaurids into 
Asia and confirms a long temporal extension for the clade into the 
Early Cretaceous, bolstering a heterodontosaurid interpretation for 
the European and North American reports. 
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Figure 2 | Integumentary 
structures of T. confuciusi holotype 
(STMN 26-3) and others for 
comparison. a, Ventral to the 
cervical vertebral series. b, Dorsal to 
the dorsal vertebral series. ¢, Dorsal 
to the proximal—middle caudal 
vertebral series. d, Close-up of 

c. Arrows in c and d point to single 
filament exhibiting a clear, dark, 
midline ‘stripe’. e, Schematic of 
long, central tail feather of 
Epidexipteryx (after ref. 12). f, Two 
types of integumentary filaments of 
Sinornithosaurus (after ref. 17). 


5mm 


The discovery of filamentous integumentary structures in 
Tianyulong provides an unprecedented phylogenetic extension of 
archosaurian dermal structures, previously reported only in derived 
theropodan saurischian dinosaurs, to Ornithischia. It also expands the 
known diversity of elaborate integumentary structures in Mesozoic 
reptiles, such as those in the diapsid Longisquama’*”’, pterosaurs**”’, 
non-avian coelurosaurian theropods'*’’ and birds'®"’, to 
ornithischians. However, the homology of these structures across taxa 
remains poorly understood. In terms of their relationships to feathers, 
both convergence**”* and homology'***”’ have been proposed to 
explain the elongate dorsal projections of Longisquama and the short, 
hair-like structures of pterosaurs; homology is favoured for the struc- 
tures of theropods*''* because they share features (such as branch- 
ing) known elsewhere only in feathers. The unique filaments of 
Tianyulong add more complexity to the issue of feather origins. 
Homology of the structures in Tianyulong and theropods is far from 
obvious with present data, but cannot be precluded. Although based 
on negative evidence, the derived position within the Theropoda 
for the known appearance of ‘protofeathers’ indicates that earlier ther- 
opods lacked integumentary structures, implying in turn that the 
common ancestor of theropods and ornithischians also lacked such 
structures and that their appearances in each clade were convergent. 

When homology with ‘protofeathers’ is hypothesized, a possible 
scenario is that the elongate, singular, cylindrical filaments of 
Tianyulong represent elaborations of the initial stage in development 
models of the origin and evolution of feathers*”’. The basal position 
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Euparkeria 
Marasuchus 
Herrerasaurus 
Pisanosaurus 
Abrictosaurus 
BMNH A100 
Echinodon 
Heterodontosaurus 
Lycorhinus 
Tianyulong 
Lesothosaurus 
Scutellosaurus 
Emausaurus 
Scelidosaurus 
Stegosauria 
Ankylosauria 
Stormbergia 
Agilisaurus 
Hexinlusaurus 
Othnielia 
Hypsilophodon 
Jeholosaurus 
Yandusaurus 
Orodromeus 
Zephyrosaurus 
Anabisetia 
Bugenasaura 
Thescelosaurus 
Talenkauen 
Gasparinisaura 
Parksosaurus 
Rhabdodontidae 
Tenontosaurus tilletti 
Tenontosaurus dossi 
Dryosauridae 
Ankylopollexia 
Stenopelix 
Wannanosaurus 
Goyocephale 
Homalocephale 
Pachycephalosauridae 
Micropachycephalosaurus 


Chaoyangsaurus 
ee Psittacosauridae 
1. Ornithischia Liaoceratops 
2. Heterodontosauridae Archaeoceratops 


Coronosauria + 


3. Genasauria Leptoceratopsidae 


Figure 3 | Phylogenetic relationships of T. confuciusi and 
Heterodontosauridae. Cladistic analysis recovers Tianyulong as a member 
of Heterodontosauridae, which is itself a basal clade of ornithischian 
dinosaurs. This is based on a strict consensus of 8 most parsimonious trees 
with a tree length of 485, consistency index of 0.501 and retention index of 
0.730. This analysis was performed by a sectorial search of TNT (tree analysis 
using new technology)*’ (see Supplementary Information). 


of Heterodontosauridae within Ornithischia then suggests that such 
early-stage structures were present in the earliest dinosaurs (before or 
at the Saurischia—Ornithischia split) and inherited by basal members 
of each group. Later ornithischians and non-theropod saurischians 
for which skin impressions are known would therefore represent 
secondary losses of these structures at as-yet undetermined systematic 
positions. The structures in Tianyulong, as well as the bristle-like 
structures of the basal ceratopsian Psittacosaurus’, which have 
frequently been perceived as nonhomologous with the filamentous 
structures of theropods, may truly be homologous, albeit derived in 
structure compared to those of theropods. 
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Population genomics of domestic and wild yeasts 


Gianni Liti'*, David M. Carter?*, Alan M. Moses”®, Jonas Warringer*, Leopold Parts”, Stephen A. James’, 
Robert P. Davey”, lan N. Roberts’, Austin Burt®, Vassiliki Koufopanou®, Isheng J. Tsai°, Casey M. Bergman’, 
Douda Bensasson’, Michael J. T. O'Kelly’, Alexander van Oudenaarden®, David B. H. Barton’, Elizabeth Bailes’, 
Alex N. Nguyen Ba’, Matthew Jones’, Michael A. Quail’, lan Goodhead’+, Sarah Sims’, Frances Smith’, 


Anders Blomberg’, Richard Durbin?* & Edward J. Louis'* 


Since the completion of the genome sequence of Saccharomyces 
cerevisiae in 1996 (refs 1, 2), there has been a large increase in 
complete genome sequences, accompanied by great advances in 
our understanding of genome evolution. Although little is known 
about the natural and life histories of yeasts in the wild, there are 
an increasing number of studies looking at ecological and geo- 
graphic distributions**, population structure’* and sexual versus 
asexual reproduction”’®. Less well understood at the whole gen- 
ome level are the evolutionary processes acting within populations 
and species that lead to adaptation to different environments, 
phenotypic differences and reproductive isolation. Here we pre- 
sent one- to fourfold or more coverage of the genome sequences of 
over seventy isolates of the baker’s yeast S. cerevisiae and its closest 
relative, Saccharomyces paradoxus. We examine variation in gene 
content, single nucleotide polymorphisms, nucleotide insertions 
and deletions, copy numbers and transposable elements. We find 
that phenotypic variation broadly correlates with global genome- 
wide phylogenetic relationships. S. paradoxus populations are well 
delineated along geographic boundaries, whereas the variation 
among worldwide S. cerevisiae isolates shows less differentiation 
and is comparable to a single S. paradoxus population. Rather than 
one or two domestication events leading to the extant baker’s 
yeasts, the population structure of S. cerevisiae consists of a few 
well-defined, geographically isolated lineages and many different 
mosaics of these lineages, supporting the idea that human influ- 
ence provided the opportunity for cross-breeding and production 
of new combinations of pre-existing variations. 

The baker’s yeast S. cerevisiae has had a long association with 
human activity’’, leading to the idea that its use in fermentation lead 
to its domestication. Two domestication events have been suggested, 
one for sake strains and one for wine”. In contrast, its closest relative, 
S. paradoxus, has never been associated with human activity and is 
found globally, sometimes in the same locations as S. cerevisiae’*. A 
preliminary comparison within the Saccharomyces sensu stricto group 
exhibited extensive variation between S. paradoxus populations on 
different continents but limited variation among S. cerevisiae isolates 
and no correlation with geographic location’®. 

Here we report nearly complete genome sequences of S. cerevisiae 
and S. paradoxus from a large variety of sources and locations 
(Supplementary Tables 1 and 2). The S. cerevisiae strains included 
the reference strain $288c plus other lab, pathogenic, baking, wine, 
food spoilage, natural fermentation, sake, probiotic and plant isolates. 


The S. paradoxus isolates were mostly from oak tree bark from the 
three recognized populations®*'’ as well as Siberia, Hawaii and the 
previously designated Saccharomyces cariocanus'*. There is overlap 
among the general geographic sources of isolates from both species. 
The majority of strains were sequenced using Sanger sequencing on 
ABI 3730 DNA sequencers (Applied Biosystems). For some strains, 
sequence was obtained using an Illumina Genome Analyzer. Most 
strains were covered to one- to fourfold depth with a few covered 
more extensively (Supplementary Table 3). The sequence reads, 
assemblies, alignments, a BLAST tool and a genome browser are all 
publicly available’>. 

We identified 235,127 high-quality single nucleotide polymorphisms 
(SNPs) and 14,051 nucleotide insertions or deletions (indels) in the 
S. cerevisiae nuclear genome, and 623,287 SNPs and 25,267 indels in 
S. paradoxus. Our S288c sequence differs from the reference genome by 
498 high-quality, unambiguous SNPs (Supplementary Fig. 1). For 480 
SNPs our S288c sequence is supported by other strains whereas the 
reference has no support, and for 18 SNPs the reference sequence is 
supported by other strains whereas ours is not. Many of the 480 SNPs 
are likely to represent errors in the reference sequence (Supplementary 
Table 4). The reference sequence for the type strain of S. paradoxus'® was 
not complete, so we sequenced the type strain CBS432 to 4.3-fold 
coverage with an ABI 3730 sequencer and to 80-fold coverage with 
the Illumina Genome Analyzer. 

Sequence surveys allow novel sequences not found in the reference 
genome to be identified. The proportions of unplaced reads for each 
strain are shown in Supplementary Table 2. We found 38 new hypo- 
thetical open reading frames (ORFs) in these sequences that are likely 
to be real. These ORFs are present in more than one strain 
(Supplementary Fig. 2), with some specific to a single lineage, such 
as the hypothetical protein 5 (Supplementary Information) in the West 
African lineage, which contains a conserved methyltransferase domain. 
Much of the unplaced material is subtelomeric. This is in contrast to a 
genome-wide analysis of copy number based on the numbers of reads 
of each strain aligning to each gene in the reference sequence, which 
showed very little significant copy number variation outside the ribo- 
somal DNA (rDNA) region (Supplementary Information). 

Neighbour-joining phylogenetic trees based on pairwise SNP 
differences in the alignments were generated (Fig. 1 and Supple- 
mentary Fig. 3). The S. paradoxus strains fall into the three previously 
described populations, plus one isolate from Hawaii. Most of the SNPs 
in S. paradoxus are private polymorphisms within each population, 
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Figure 1| Saccharomyces phylogenomics. a, Neighbour-joining trees based 
on SNP differences of S. cerevisiae and S. paradoxus strains sequenced in this 
project, using Saccharomyces mikatae, Saccharomyces kudriavzevii and 
Saccharomyces bayanus as out-groups. b, Close-up of the European 


resulting in a clear separation of the three populations” (Fig. 2a). The 
European population was sampled extensively, which provided a 
picture of within-population structure (Fig. 1b). 

The S. cerevisiae population structure is more complex. There are 
five lineages that exhibit the same phylogenetic relationship across 
their entire genomes, which we consider to be ‘clean’ non-mosaic 
lineages (Fig. 1c). These are strains from Malaysia, West Africa, sake 
and related fermentations (labelled ‘Sake’ in Fig. 1c), North America, 
and a large cluster of mixed sources containing many European and 
wine strains (“Wine/European’). The remaining strains are on long 
branches between the Wine/European cluster and the other four clean 
lineages. Although some lineages correspond to geographic origin, 
such as those from North America and Malaysia, many closely related 
strains are from widely separated locations. This mixed architecture 
could be due to human traffic in yeast strains and subsequent recom- 
bination between them. Analysis with Structure is consistent with 
separate populations for the West African, Malaysian, Sake and 
Wine/European lineages (Fig. 2b). The North American isolates share 
some polymorphisms with all four separate populations, whereas the 
rest of the strains share polymorphisms with the European lineage and 
at least one other population. Analysis of SNP distributions 
(Supplementary Table 5) is consistent with the neighbour-joining tree 
phylogeny (Fig. 1c) and the Structure analysis (Fig. 2b). Each clean 
lineage is monomorphic for the majority of segregating sites, whereas 
the mosaics are polymorphic for the majority of sites. 

Phylogenetic trees constructed for individual chromosomes or 
smaller segments (Supplementary Fig. 4) demonstrate the mosaic 
nature of these genomes, as do segmental comparisons (Supple- 
mentary Fig. 5). For example, the laboratory strains SK] and Y55 
appear to be the result of recent crosses between the West African 
lineage and the European lineage (Supplementary Fig. 5b). Similarly, 
W303 is a recent cross between the reference S288c lineage and one or 
more other lineages. Different segments of the mosaics fall into differ- 
ent locations in the neighbour-joining tree (Supplementary Fig. 4). The 
recently sequenced clinical derivative YJM789 (ref. 18) is another 
example. This complex population structure of S. cerevisiae is reported 
ina similar study’’ and is consistent with five well-delineated lineages, 
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S. paradoxus, with UK isolates highlighted in violet. ¢, S. cerevisiae strains 
with clean lineages highlighted in grey; colour indicates source (name) and 
geographic origin (dots). Scale bars indicate frequencies of base-pair 
differences. 


two of which contain isolates used in fermentation industries’’, plus a 
number of recombinant strains, many of which are also used for 
fermentation. Phenotypic profiling (see below), and analyses of 
rDNA repeat unit variation (Supplementary Fig. 6) and Ty element 
abundance (Supplementary Fig. 7 and Supplementary Table 6), pro- 
duce results consistent with this overall picture of the S. cerevisiae 
population structure. 

It is unlikely that the entire sequence space of S. cerevisiae has been 
sampled. It is clear that segments from many of the mosaic strains are 


S. paradoxus 


European Far Eastern American Hw 


S. cerevisiae 


Fraction of markers assigned to each cluster 


Malaysian Sake NA WA 


Mosaics Wine/European 


Figure 2 | Saccharomyces population structure. a, Inference of population 
structure using the program Structure (version 2.1) on S. paradoxus 
(markers: 7,544 SNPs with >30 strains passing neighbourhood quality 
standard), assuming K = 6 subpopulations and correlated allele frequencies, 
linkage model based on marker distances in base pairs, 15,000-iteration burn 
in, and 5,000 iterations of sampling. Each mark on the x axis represents one 
strain, and the blocks of colour represent the fraction of the genetic material 
in each strain assigned to each cluster. Hw, Hawaiian isolate. b, As in a, but 
for S. cerevisiae (markers: 3,413 SNPs with >30 strains passing 
neighbourhood quality standard). NA, North America; WA, West Africa. 
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not related to any of the five clean lineages and are probably derived 
from lineages that are yet to be determined or no longer exist. One- 
quarter (24%) of SNPs are found only in the mosaics (Supplementary 
Table 5), which provides a measure of the unsampled S. cerevisiae 
species space. 

Sequence variability was quantified using the average pairwise diver- 
gence within a population (0,) and the proportion of polymorphic 
sites (Og)'®. We estimated these parameters for various populations 
(Supplementary Table 7). Both 0, and 0s are about 0.001 in the UK 
population of S. paradoxus. The Wine/European cluster of S. cerevisiae 
has approximately the same level of diversity. In both the global and 
Wine/European samples of S. cerevisiae, Tajima’s D (ref. 20) is signifi- 
cantly negative, indicating an excess of singleton polymorphisms, 
which may be a consequence of our sampling strategy. By contrast, 
the UK sample of S. paradoxus from a single population has a positive 
Tajima’s D, although not significantly, indicating a relative abundance 
of mid-frequency polymorphisms. Linkage disequilibrium differs 
between samples (Fig. 3a). For S. paradoxus, linkage disequilibrium 
declines smoothly with distance, decaying to half its maximum value at 
about 9 kb, as previously reported". For both S. cerevisiae samples, the 
linkage disequilibrium decays much faster, with a half maximum at 
3 kb or less. This implies more recombination in S. cerevisiae, perhaps 
due to more opportunities for strains to mate and recombine. 

Patterns of variation can reveal evidence of natural selection. As 
expected for weakly deleterious mutations, the derived allele frequencies 
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Figure 3 | Population genomics: variation and selection. a, Linkage 
disequilibrium as a function of distance averaged over one kilobase (kb), 
expressed in terms of correlation coefficient, r’. Insets show the decline in 
linkage disequilibrium over the first 10 kb. Details are shown in 
Supplementary Table 7. b, Derived allele frequencies of SNPs in coding 
regions. Amino-acid-changing SNPs (labelled ‘a’) show an excess of low 
frequencies in comparison with synonymous SNPs (‘s’). Synonymous SNPs 
in genes with strong codon bias (‘s*’) are in excess at low and high 
frequencies. SNPs that create stop codons (‘create stop’) show skew to low 
frequencies. Inset is the number of mutations occurring over the length of 
the protein, exceeding three standard deviations (a) from the mean (1) in the 
C terminus. ¢, Distribution of sizes of indel polymorphisms in coding 
regions. High-frequency indels (>10%, red) more often occur in multiples 
of three than do low-frequency indels (grey). Inset is as for b. d, Frequency 
distribution of indels in coding regions. Out-of-frame indels (grey) show 
excess at low frequencies relative to in-frame indels (open). The proportion 
of out-of-frame indels decreases as frequency increases. Error bars represent 
the standard error of the proportion. Numbers of observations for each bin: 
0, n = 2,910; 0.1, n = 184; 0.15, n = 68; 0.2, n = 52; 0.25, n = 29; 0.3, n = 29; 
0.35, n = 36; 0.4, n = 29; 0.45, n = 40. 
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(DAF; see Supplementary Information) for non-synonymous poly- 
morphism are lower than synonymous polymorphism (Fig. 3b). For 
polymorphisms with DAF <20%, there were 0.86 amino-acid- 
changing polymorphisms for each silent one. In contrast, for those with 
DAF >20% this ratio was 0.34, indicating that at least 61% (1 — 0.34/ 
0.86) of the 24,418 amino-acid-changing polymorphisms with DAF 
<20% are deleterious. Similar calculations (Supplementary Infor- 
mation) indicate that 27% of non-coding polymorphisms with DAF 
<20% are deleterious (Supplementary Fig. 8a). We also performed 
McDonald-Kreitman tests”! on 1,105 genes for which we had enough 
statistical power. No evidence for positive selection after a multiple 
testing correction was found (Supplementary Fig. 8b). These analyses 
assumed that synonymous polymorphisms are neutral. However, we 
found an excess of polymorphism at both low and high frequency 
(Fig. 3b) in genes with high codon bias (codon adaptation index 
>0.6). Further analysis (Supplementary Information) indicates that 
codon bias in S. cerevisiae is maintained by both purifying and positive 
selection, as suggested by the mutation-selection-drift model”. 

A previous genome-wide study in Arabidopsis*’ reported a large 
number of seemingly highly deleterious alleles. We found 134 muta- 
tions that were predicted to introduce stop codons (Fig. 3b), includ- 
ing five in genes previously reported to be essential in S288c 
(Supplementary Information). These mutations showed a skewed 
frequency distribution and were enriched in the C termini (the final 
5% of proteins; Fig. 3b inset). 

This data set allowed the consideration of insertions and deletions 
(Fig. 3c). We identified 3,870 indels in the coding regions of the 
S. cerevisiae population. Of these, 731 had minor allele frequency 
(MAF) greater than 10%. We also found 657 indels (72 with MAF 
> 10%) in genes identified as essential (Supplementary Information). 
Indels with MAF > 10% predicted to cause frame shifts were enriched 
in the C-terminal 5% of the protein (Fig. 3C, inset). The proportion 
of frame-shift to in-frame indels decreases strongly as a function of 
MAF (Fig. 3d). For example, at MAF >15% there are 1.0 out-of- 
frame indels for every in-frame indel, in comparison with 15.5 at 
MAF <10%. We estimate that 93% (1 — 1.0/15.5) of the 2,949 out- 
of-frame indels with MAF <10% are deleterious. 

All strains were subjected to high-throughput phenotypic analysis 
under multiple conditions (Fig. 4 and Supplementary Fig. 9). Growth 
curves were sampled (>250 time points) over three days and the 
relevant growth variables—lag (adaptation), rate (slope) and effi- 
ciency (maximum density)—were extracted™*, providing roughly 
200 phenotypic traits. The phenotypic variation allowed clustering 
of strains. There is a high qualitative overlap between the phenotypic 
clustering and the phylogenies based on SNPs (Figs 1 and 4). Also, the 
correlation between genotypic and phenotypic similarity within 
S. cerevisiae is surprisingly good (Spearman’s rank test: correlation 
coefficient, 0.30; P= 10° °) given that conventional phenotypic taxo- 
nomy generally fails even to resolve the Saccharomyces sensu stricto 
species. No individual environment determined the overall correlation 
between genotype and phenotype. 

The S. paradoxus strains were well separated from the S. cerevisiae 
strains (Fig. 4), except for the Hawaiian isolate. The phenotypes sepa- 
rating the two species most clearly (P< 10 ”) were strong S. paradoxus 
resistance to cycloheximide and sensitivity to paramomycin, heat and 
copper (Supplementary Fig. 10a). The S. cerevisiae isolates fell into two 
groups (Fig. 4). One contains most of the Wine/European and Sake 
lineages and most of the long-branch recombinants, whereas the other 
mainly consists of the North American, Malaysian and African 
lineages. The main phenotypic characteristic separating these groups 
is rapid growth (short lag and steep slope in rate, P< 10~*) for the 
Wine/European lineages and the mosaics, which could be advantage- 
ous for the fermentation processes in which many of these strains are 
used (Supplementary Fig. 10b). Despite genomic variation, S. para- 
doxus strains (excluding the Hawaiian isolate) show 38% lower pheno- 
typic variation than S. cerevisiae strains (P = 0.002). In S. cerevisiae, the 
phenotypic variance is as high among the clean lineages as among the 
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Figure 4 | Saccharomyces phenotype variation. A selection of growth 
phenotypes for S. cerevisiae and S. paradoxus strains in different 
environments and drugs. The complete set of lag, rate and density 
phenotypes in 67 environments is displayed in Supplementary Fig. 9. 
Phenotypes were quantified using high-resolution micro-cultivation 
measurements of population density. Strain (n = 2) doubling time (rate) 


mosaic lineages (P = 0.78). Hence, the higher phenotypic variance in 
S. cerevisiae is not driven by out-breeding or domestication per se, but 
rather suggests that S. cerevisiae occupies a wider diversity of ecological 
niches than S. paradoxus. 

This survey of S. cerevisiae and S. paradoxus population genomics 
reveals extensive differences in genomic and phenotypic variation 
despite ecological similarities and will allow rapid fine mapping of 
the genetic determinants. Domestication of S. cerevisiae has previously 
been debated’*. Our results could be interpreted in two ways. One is as 
a domestication of one or two groups, the Wine/European and Sake 
strains, with selection for improved fermentation properties. These 
domesticated groups then gave rise to feral and clinical derivatives 
and were involved in the generation of out-crossed derivatives found 
in all sources. The alternative interpretation is that human activity 
simply may have used existing strains from populations that had 
appropriate fermentation properties providing the opportunity to 
out-breed through movement of strains and supplying a novel 
disturbed environment. Using domestication to imply “species bred 
in captivity”, the strains that best fulfil this definition are the baking 
isolates, as they have clearly arisen from crosses between lineages. 
Lineages that were selected from captively bred strains would be 
expected to have lower diversity than other lineages. This is not the 
case for the Wine/European or Sake lineages, which have similar or 
greater levels of diversity in comparison with the other clean lineages or 
to S. paradoxus populations. This view of human activity simply mov- 
ing yeast strains around without captive breeding is consistent with 
analysis of over 600 strains”®. Recent findings in the Malaysian rain- 
forest (from which our three Malaysian S. cerevisiae strains were iso- 
lated) of chronic intake of alcoholic nectar from Bertram palms by wild 
tree shrews suggest that the association of fermented beverages and 
primates is ancient and not exclusive to humans”. 

Beyond the analyses we have presented here, the sequence data we 
have obtained for these strains have many other applications, and have 
already been used both for global’* and gene-specific” studies. With 
the advent of new sequencing technology, it is becoming possible to 
undertake similar population genomic studies for species with much 
larger genomes, including humans”’, enabling a new era of genome 
wide evolutionary and functional genetics. 
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“eRe LSC 


n-fold 


DBVPG6304 
UFRJ50791 
UFRJ50816 
YPS138 
DBVPG6044 
DBVPG1853 
NCYC110 


S2NoD ONS 
OaAN NH 


Cold 16 
Cold 19 
Cold 24 
pH7 
Heat 37 
Heat 40 °C 

Ethanol (stress) 5% 

Ethanol (stress) 7% 

pH 3.5 

pH 2 

Superoxide (paraquat) 250 jig mi? 
Superoxide (paraquat) 400 xg mi 


Cisplatin 150 xg mI 

Cisplatin 100 xg mi" 

MMS 0.01% 

MMS 0.005% 

Anisomycin 0.5 jig mit 

Anisomycin 1 jig mit 

Hydroxyurea 15 mg mi 

Hydroxyurea 10 mg mt 

Tunicamycin 1.5 ug mi 

Tunicamycin 1 jg mit 

Cycloheximide 0.05 jg mit 

Cycloheximide 0.1 xg mr 

Caffeine 1.5 mg mr 

Caffeine 2.25 mg mi 

Paramomycin 10 mg mi 

Paramomycin 15 mg mit 

phenotypes in relation to the $288c derivative BY4741 (n = 20) are 
displayed. Green, poor growth; red, good growth. Hierarchical clustering of 
phenotypes was performed using a centred Pearson correlation metric and 
average linkage mapping. Blue, S. paradoxus; pink, S. cerevisiae; grey, 


S. bayanus isolate CBS7001. LSC, logarithmic strain coefficient. 


METHODS SUMMARY 


Strains to be sequenced were selected to maximise the variety of sources and 
locations of isolation. Except for laboratory strains, a single meiotic diploid spore 
was isolated from the original strain to remove any heterozygosity*. DNA was 
extracted from overnight cultures* for subsequent sequencing on ABI 3730 DNA 
sequencers and an Illumina Genome Analyzer'®. Reference-based genome 
assemblies were created for each strain in a series of steps'*. Each read was aligned 
to the reference genome (S$288c or CBS432). As this approach cannot deal with 
large indels or with sequences not present in the reference genome, we developed 
an iterative parallel-alignment assembling tool, PALAS (Supplementary 
Methods), to introduce insertions that were allowed to share material between 
related strains. Two versions of each strain sequence were produced, a partial 
assembly derived just from data collected from that strain and a more complete 
assembly using an imputation process to infer the most likely sequence of the 
strain taking into account data from related strains. In both cases, confidence 
estimates are given for each base call. The SNPs obtained were used to generate 
neighbour-joining phylogenetic trees'’, infer population structure’’, estimate 
sequence divergence'’ and analyse polymorphisms'®. Non-aligned reads (those 
missing in the reference genome) were searched for potential novel genes. Each 
strain isolate was subjected to precise phenotyping in 67 experimental conditions 
using a high-resolution micro-cultivation Bioscreen C (Oy Growth Curves, 
Finland)**. Two consecutive rounds of 48-h pre-cultivation in synthetic com- 
plete media were followed by a 72-h cultivation in stress media. Readings of 
optical density were taken every 20min. Strains were tested as duplicates 
(N= 2). Growth variables were normalized to the behaviour of the 20 BY4741 
replicates. 

Details of the methods mentioned above are provided in Supplementary 
Information. 
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Comprehensive polymorphism survey elucidates 
population structure of Saccharomyces cerevisiae 


Joseph Schacherer'*+, Joshua A. Shapiro'*, Douglas M. Ruderfer’ & Leonid Kruglyak' 


Comprehensive identification of polymorphisms among indivi- 
duals within a species is essential both for studying the genetic basis 
of phenotypic differences and for elucidating the evolutionary 
history of the species. Large-scale polymorphism surveys have 
recently been reported for human', mouse’ and Arabidopsis 
thaliana’. Here we report a nucleotide-level survey of genomic 
variation in a diverse collection of 63 Saccharomyces cerevisiae 
strains sampled from different ecological niches (beer, bread, vine- 
yards, immunocompromised individuals, various fermentations 
and nature) and from locations on different continents. We hybri- 
dized genomic DNA from each strain to whole-genome tiling 
microarrays and detected 1.89 million single nucleotide poly- 
morphisms, which were grouped into 101,343 distinct segregating 
sites. We also identified 3,985 deletion events of length >200 base 
pairs among the surveyed strains. We analysed the genome-wide 
patterns of nucleotide polymorphism and deletion variants, and 
measured the extent of linkage disequilibrium in S. cerevisiae. 
These results and the polymorphism resource we have generated 
lay the foundation for genome-wide association studies in yeast. 
We also examined the population structure of S. cerevisiae, provid- 
ing support for multiple domestication events as well as insight into 
the origins of pathogenic strains. 

With their small and compact genomes, the hemiascomycetes (the 
group of fungi that includes S. cerevisiae) represent a powerful model 
for comparative genomics and studies of genome evolution*®. As a 
result, more than 18 hemiascomycetes species are either completely 
or partially sequenced. The availability of the sequence data has 
presented an unprecedented opportunity to evaluate DNA sequence 
variation and genome evolution in a phylum spanning a broad evolu- 
tionary range’. This wealth of data on interspecific sequence differ- 
ences stands in contrast to our limited knowledge of sequence 
variation within S. cerevisiae. Because of its importance both to 
human activities and as a model system, we sought to generate a 
comprehensive view of sequence polymorphism in S. cerevisiae. To 
determine sequence variation at the nucleotide level, we hybridized 
genomic DNA from 63 ecologically and geographically diverse 
strains (Supplementary Table 1) to a high-density Affymetrix Yeast 
Tiling Microarray (YTM) and identified positions likely to differ 
from the reference sequence with the software package 
SNPscanner®. We detected a total of 1,896,131 single nucleotide poly- 
morphisms (SNPs) in non-repetitive regions of the genome 
(Supplementary Table 1). Because of variation of up to a few base 
pairs (bp) in the location of SNPs detected by SNPscanner, we used a 
grouping procedure (see Methods) to identify the sites of poly- 
morphic variation across strains. We also removed all singletons 
(SNPs called in only one strain) to reduce false positives further. 
This approach detected a total of 1,299,811 individual SNP calls, 
which were grouped into 101,343 distinct segregating sites. At each 


of these sites, every strain was classified as having either the same or 
different nucleotide relative to the reference strain (S288c). 

We evaluated the coverage and accuracy of our polymorphism 
survey by comparing our data with the low-coverage sequence 
generated by ref. 9; 13 strains are shared between the two data sets. 
Most of the array-called SNPs with sequence data in the region had 
corresponding polymorphisms in the sequence data (median of 92% 
per strain), showing that our data have a low false-positive rate. Array- 
based polymorphism calls captured most (median of 73% per strain) 
of the high-quality (quality score >30), independent (>25 bp from 
the next closest polymorphism) SNPs present in the sequence data, 
showing that our data have high coverage. Discrepancies between 
array-based and sequence-based polymorphism calls probably reflect 
false positives and false negatives in each type of data, and may also 
derive from genuine sequence differences between strains with the 
same name but obtained from different sources by the two studies. 

We detected an average of 30,097 SNPs per strain (Supplementary 
Table 1). Excluding laboratory strains, most of which are closely related 
to the reference strain, the frequency of polymorphisms varied between 
0.0011 to 0.0041 per bp (0.0028 on average), representing an average 
density of 2.8 SNPs per kilobase (kb). Across all strains, we observed 
8.35 non-singleton segregating sites per kb (Qwkb ' = 2.26). The 
frequency spectrum of the observed polymorphisms is highly skewed 
towards an excess of low-frequency alleles, even after corrections for 
the grouping procedure and genotyping errors (Supplementary Fig. 1). 
This excess of rare alleles resulted in a lowered value for the frequency- 
weighted measure of nucleotide diversity (1 kb” ' = 1.92). Some of the 
excess of low-frequency alleles can be attributed to the presence of 
slightly deleterious variants, which are kept at low frequency by nega- 
tive selection but have not yet been purged from the population. We 
expect that mutations in coding regions are more likely to be deleteri- 
ous than those in noncoding regions, resulting in a lower overall level of 
polymorphism in coding regions, and we do observe that coding 
regions are approximately 17% less polymorphic than noncoding 
regions (Table 1). The coding regions also show a larger excess of 
low-frequency polymorphism in their frequency spectrum 
(Supplementary Fig. 2). These trends are further emphasized in the 
1,114 genes known to be essential in the reference strain S288c, which 
show both a lower overall level of polymorphism and a greater skew in 
the frequency spectrum. Noncoding regions are subject to selection on 
regulatory elements. Short intergenic regions should carry a higher 
proportion of functional regulatory sequences than longer noncoding 
regions, and we observe that intergenic regions shorter than 300 bp 
have significantly lower rates of polymorphism than longer regions 
(Supplementary Fig. 3). We found a markedly nonrandom distri- 
bution of polymorphism levels across the genome. We observed a 
decrease in SNP density within 25kb of centromeres (Supple- 
mentary Fig. 4a). This observation is consistent with the lack of 
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Table 1| Functional variation in levels of polymorphism 


Genomic region Number Ow kb? mkb 2 
of segregating sites 

Noncoding regions 33,465 2.69 2.33 

Coding regions 71,420 2.23 1.89 

Paralogues 14,774 2.28 1.93 

Essential genes 13,107 2.08 1.77 


In a standard neutral model 4Nu is the population mutation rate, where N is the effective 
population size and yz the rate of mutation. Aw is Watterson’s theta: an estimate of 4Nu based on 
the number of segregating sites. z is an estimate of 4Ny equal to the mean pairwise difference 
among individuals. 


DNA double-strand breaks (that is, the presence of meiotic recom- 
bination cold spots) near the centromeres"’. By contrast, subtelomeric 
regions, which undergo frequent recombination’, show higher vari- 
ation at the sequence level in the regions 15—45kb from telomeres 
(Supplementary Fig. 4b). 

The genomic extent of linkage disequilibrium—nonrandom asso- 
ciation of alleles at different polymorphic sites—provides informa- 
tion about recombination and population structure, and is also a 
critical parameter for population studies of association between geno- 
type and phenotype. Our data provided the first opportunity to 
measure genome-wide properties of linkage disequilibrium across a 
large collection of diverse strains. We examined pairwise linkage dis- 
equilibrium for the 101,343 segregating sites and found that linkage 
disequilibrium falls to one-half of its maximum value at about 11 kb 
(Fig. 1). Because the yeast genome is physically compact (12 Mb), the 
101,343 segregating sites reported here (nearly a site every 100 bp, of 
which close to one-half have a minor allele frequency > 10%) provide 
a high-density polymorphism resource for S. cerevisiae from which an 
optimized panel of sites sufficient for whole-genome association 
studies in yeast can be chosen. To characterize further the architecture 
of linkage disequilibrium, we examined each of the sampling groups 
that contained at least 10 strains (wine, clinical, distillery and 
laboratory strains; Supplementary Fig. 5). In the wine strains, linkage 
disequilibrium falls to half of its maximum value at ~2.5 kb, but is 
more extensive in clinical (~7 kb), distillery (~9.5 kb) and laboratory 
(~23.8 kb) strains. Because most of the laboratory strains are recently 
derived from the same founder strain S288c"’, linkage disequilibrium 
is expected to be greater than in the other groups. By contrast, the low 
level of linkage disequilibrium in the wine strains probably reflects a 
long time since the most recent common ancestor of these strains, and 
perhaps a higher frequency of outcrossing events. 

To examine structural variation, we identified all deletion 
events >200 bp in the 63 strains (Supplementary Tables 1 and 2). 
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Figure 1| Decay of linkage disequilibrium as a function of distance. 
Averages of pairwise linkage disequilibrium measures r° (black circles) and 
D’ (open circles) are plotted for each bin of distances between pairs of SNPs. 
The linkage disequilibrium values were corrected for finite-size effects by 
subtracting the average value computed for a random subset of pairs of SNPs 
located on different chromosomes. 
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We observed 3,985 deletions (an average of 63 per strain). The num- 
ber of deletions varied from one in BY4716 (which is isogenic to the 
reference but carries an engineered deletion of LYS2) to 106 in 
YJM320. The deletions ranged in size from 200 bp to 13.8 kb, with 
nearly half falling between 200 bp and 400 bp (Supplementary Fig. 6). 

The deletions are unevenly distributed across the genome 
(Supplementary Fig. 7), with enrichment in subtelomeric regions 
(45.4% of events in<10% of the genome; Supplementary Fig. 7b) 
and a deficit near the centromeres (Supplementary Fig. 7a). These 
patterns are consistent with SNP rates and may similarly be explained 
by variation in recombination rates. A total of 254 genes contained a 
whole (119 genes) or partial (135 genes) deletion in at least one strain 
(Supplementary Table 3). Most were deleted in one to four strains, 
but some were deleted in many strains (Supplementary Fig. 8). For 
example, the gene YAR047C is deleted in 59 of the 63 surveyed 
strains. This gene is annotated as a dubious open reading frame 
(ORF) unlikely to encode a protein, on the basis of comparative 
sequence data of Saccharomyces sensu stricto species*. Our obser- 
vation within the S. cerevisiae species strongly supports this hypo- 
thesis. Dubious ORFs accounted for 37 of the gene deletions. The set 
of deleted genes is enriched for those with known functions in trans- 
port, and in particular for sugar and hexose transporters 
(Supplementary Table 4). Most of these deleted genes are located 
in the subtelomeric regions. These results provide clear evidence of 
the importance of variation at subtelomeric regions in adaptation of 
strains to different carbon sources, as previously suggested’. 

We looked for deletions in genes known to be essential in the $288c 
strains'*. We observed partial deletions in only four of the 1,114 
essential genes (KRS1, PGS1, SMT3 and ERG20), many fewer than 
the 49.6 genes that would be expected from the overall deletion 
frequency (x7 = 52; P< 0.0001), which shows that the vast majority 
of the genes defined to be essential in the S288c background are also 
essential in all other genetic backgrounds of S. cerevisiae. With the 
exception of KRS1, these deletions were observed in only a few strains 
(Supplementary Table 3). Moreover, the deletions observed in the 
four essential genes affect a small fraction of the ORF, and the genes 
may still be functional. We examined more closely the partial dele- 
tion in KRS1, which encodes the lysyl-tRNA synthetase. We looked at 
the spore viability from crosses between the S288c reference strain 
and several of the strains (K1, CLIB219, K12 and Y9) in which the 
KRS1 gene is partially deleted (Supplementary Fig. 9). We observed a 
high spore viability of around 90% in each cross, which shows that 
the KRS1 gene is still functional in these strains. We also observed a 
reduced deletion rate in duplicated gene pairs derived from the whole 
genome duplication event (20 observed compared with 49.4 
expected; y = 21.7; P< 0.0001; Supplementary Table 3). 

We sought to use the genome-wide genotypes at the 101,343 poly- 
morphic sites across our diverse collection of strains to elucidate the 
phylogenetic relationships among strains and to evaluate the effects of 
ecological factors and geographical locations on strain diversity. We 
used standard neighbour-joining methods to build a majority-rule 
consensus tree of the surveyed strains (Fig. 2), and also analysed the 
data with the model-based clustering algorithm implemented in the 
program Structure’? (Fig. 3). Both analyses showed at least three dis- 
tinct subgroups based on the source from which the strains were iso- 
lated. Most of the wine strains (with the exception of CLIB219, which 
was isolated in Russia) are members of a single well-defined subpopu- 
lation. Because these wine strains were collected from dispersed loca- 
tions, this observation provides strong evidence of a single 
domestication event of yeast for winemaking, followed by human- 
associated migration of wine yeast all over the world. The wine strains 
show the lowest level of polymorphism among the groups (Table 2), as 
well as an excess of low-frequency SNPs, consistent with a bottleneck 
during domestication. This subpopulation also includes a number of 
strains collected from distilleries, nature (soil, cocoa beans, prickly 
pear and Tuber magnatum) and clinical sources, indicating that these 
strains derived from domesticated wine strains, which transited out of 
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Figure 2 | Neighbour-joining tree of 63 

S. cerevisiae strains. The tree was constructed on 
the basis of the 101,343 segregating sites 
identified in the surveyed strains. Branch lengths 
are proportional to the number of the 101,343 
segregating sites that differentiate each pair of 
strains. Font colour of strain name denotes 
geographical origin and circle colour denotes 
ecological niche as specified in the key. 
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this group to other human-associated fermentations as well as back 
into nature and therefore escaped their human-manufactured envir- 
onment. The second major population group contains the strains used 
for sake production and provides strong evidence for a second and 
independent domestication event, as hypothesized previously'®. The 
laboratory strains, with the exception of SK1, form a third clear group, 
a consequence of the fact that most of the commonly used S. cerevisiae 
strains, with the exception of SK1, are derived from the S288c genetic 
background”. It is worth noting that the EM93 strain, the progenitor 
of S288c originally isolated from a rotting fig’’, is seen to be closely 
related to the laboratory strains. A number of strains did not fall into 
clear groups on the tree and did not cluster into coherent groups in the 
Structure analysis; their genomes seem to be mosaics of contributions 
from the three genetically distinct subgroups. 


Estimated proportion of ancestry 


Clinical 


Although S. cerevisiae is usually considered to be a benign orga- 
nism, there is a growing recognition that it can be a cause of oppor- 
tunistic pathogenic fungal infection, typically, but not exclusively, in 
immunocompromised individuals'*. To investigate the origin of 
these strains, we examined 16 strains isolated from different clinical 
sources (for example, blood, mouth, sputum) in Europe and the 
Americas (Supplementary Table 1). The clinical isolates were broadly 
distributed across the tree, and did not cluster with each other or with 
any one subgroup of strains in the Structure analysis (Figs 2 and 3). 
Three European clinical strains (YJM434, YJM978 and YJM981) were 
closely related to wine strains. Three other European strains from the 
same geographical origin (Newcastle, UK) were closely related to 
each other, and had some similarity to beer and baker strains. The 
remaining ten strains (nine American, one European) branched from 


Figure 3 | Population structure of 
63 S. cerevisiae strains. Cluster 
results from a structure analysis on 
101,343 segregating sites identified 
in the 63 surveyed strains. The 
Structure program implements a 
Bayesian model-based clustering 
algorithm that attempts to identify 
genetically distinct subpopulations 
on the basis of patterns of allele 
frequencies’’. Each strain is 
represented by a single vertical bar, 
which is partitioned into K coloured 
segments that represent the strain’s 
estimated ancestry proportion in 
each of the K clusters. 
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Table 2 | Polymorphism and SNP frequency within groups 


Strains Owkb-* mkb + 
All 2.26 1.92 
Wine 1.38 1.25 
Clinical 2.53 2.54 
Distillery 2.54 2.60 
Nature 2.03 1.97 


See legend to Table 1. 


a similar part of the tree, but did not appear to be closely related to 
each other or to any other coherent group of strain. Our interpreta- 
tion of these results is that clinical isolates do not derive from a 
common ancestor or any one type of strain, but rather represent 
multiple events in which strains present in the environment oppor- 
tunistically colonize human tissues. Our data provide strong 
evidence that wine strains are capable of such colonization, and sug- 
gest that strains from other sources (beer, bakery, laboratory, nature) 
can also do this. These results are consistent with clinical reports of 
patients infected with S. cerevisiae baker’s strains and with the strain 
Saccharomyces boulardii, which is used therapeutically to treat dia- 
rrhoea and is also sold as a probiotic nutritional supplement”. 
Because the main environmental niches for S. cerevisiae in nature 
are not known, clinical strains might represent the best approxi- 
mation of the overall species diversity of S. cerevisiae. 

The polymorphism resource we generated, made freely available in 
the Yeast SNPs Browser database (http://gbrowse.princeton.edu/ 
yeastSNP), enables genome-wide association studies of the pheno- 
typic differences among these and other yeast strains. Phenotypic 
diversity among yeast isolates is significant, and variation is apparent 
among the surveyed strains at different levels. The genetic basis of a 
number of interesting phenotypes can be studied in yeast, including 
growth at high temperature, sporulation efficiency, telomere length, 
gene expression and response to drugs*’*; these studies can now 
move from linkage in crosses between two strains to the population 
level. S. cerevisiae provides a powerful model system for studies of 
complex traits because of the ease with which genetic analyses and 
phenotyping can be carried out and the ability to engineer and test 
the effects of individual polymorphisms and their combinations on 
different genetic backgrounds. 

Our analysis also provides insight into the population structure of 
this yeast species. We show evidence for genetic differentiation of 
three distinct subgroups based on the source from which the strains 
were isolated: vineyards, sake and related fermentations, and labor- 
atory strains. Thus, population structure at least partly reflects dif- 
ferent ecological niches. Surveys of additional strains are needed to 
resolve fully the roles of ecology versus geography in the genetic 
differentiation of this species. Our data strongly support the hypo- 
thesis that these three groups represent separate domestication 
events, and that S. cerevisiae as a whole is not domesticated. 
Finally, our results suggest that S. cerevisiae strains from a range of 
environments are capable of opportunistic colonization of human 
tissues. 


METHODS SUMMARY 


Genomic DNA was extracted from 63 yeast strains (listed in Supplementary 
Table 1) and hybridized to Affymetrix Yeast Tiling Arrays. We used 
SNPscanner* to identify putative SNPs in each of the 63 strains on the basis of 
the hybridization intensity at each probe. Because there is error in the precise 
location of SNP calls made by SNPscanner, we used a grouping procedure 
(described in Methods) in order to integrate SNP calls across strains and 
minimize the effects of erroneous positive and negative calls. 

We constructed a neighbour-joining tree of the 63 strains from the SNP data 
using Splitstree**, with branch lengths proportional to the number of segregating 
sites that differentiate each node. To infer the population ancestry of the strains we 
used Structure’’, with ancestral population numbers between two and six. We 
calculated linkage disequilibrium across the genome using two standard metrics: 
D’ and r’, both for the whole genome and for each subpopulation. We calculated 
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other population genetic summary statistics using code based on the libsequence 
package”*, and performed coalescent simulations of genome evolution using 
FastCoal”’, with corrections for expected error rates and our grouping procedure. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Yeast strains. Yeast strains were obtained from a number of laboratories: J. Fay 
(Washington University), J. Perez-Ortin (University of Valencia), G. Liti and E. 
Louis (The University of Nottingham), J. McCusker (Duke University) and Jean- 
Luc Souciet (Louis-Pasteur University). We also purchased strains from differ- 
ent yeast culture collections: CLIB (Collection de Levures d’Intérét 
Biotechnologique), CBS (Centraalbureau voor Schimmelcultures), DBVPG 
(Dipartimento di Biologia Vegetale e Agroambientale of the University of 
Perugia) and CECT (Coleccion Espanola de Cultivos Tipo). Strains used in this 
study are listed in Supplementary Table 1. 

SNP identification. Yeast strains were grown in yeast extract, peptone and dex- 
trose (YPD) medium. Total genomic DNA was purified from 30 ml YPD culture 
using Qiagen Genomic-Tips 100/G and Genomic DNA buffers as per the manu- 
facturer’s instructions. Genomic DNA was digested with DNasel, labelled and 
hybridized to Affymetrix Yeast Tiling Arrays (YTMs) as described previously* 

We used SNPscanner’ to identify putative SNPs in each of the 63 strains on the 
basis of the hybridization intensity of DNA at each probe. SNPs from each strain 
were independently called against the reference FY3 genome using the following 
parameters: lod score >2, number of probes covering a base >1, and positive 
region length >6. These parameters are further described in ref. 8 and in the 
SNPscanner documentation (http://genomics-pubs.princeton.edu/SNPscanner/ 
). With these parameters, we previously showed, using the complete genome 
sequence of strain YJM789, that 90.1% of true SNPs were detected, with only 
49 false-positive SNP calls over the entire genome (a false-positive rate of 4 X 10° 
per bp). 

Owing to the 4-bp resolution of the YTMs and the variance associated with 
DNA hybridization intensities, the SNP position predicted by SNPscanner may 
fall at varying positions surrounding the actual site of the SNP. This variance 
required us to perform a grouping procedure, combining all the calls within 6 bp 
of each other into a single segregating site. As the average density of putative 
SNPs is 1 per 6.3 bp, the probability of grouping two distinct sites is nontrivial. 
To reduce this probability, we implemented several heuristic filters: first, to 
reduce false positives, we required that at least one of the called SNPs in each 
grouping have a lod score >6. Second, we eliminated possible deletion events by 
removing putative SNPs with large prediction regions (>100 bp). Finally, we 
required at least 9 bp between each SNP in a group and the next closest call in the 
genome. We performed this grouping procedure in a top-down manner, by first 
grouping the SNPs with the most calls at a given position. 

We tested the accuracy of this grouping procedure using a set of known high- 
confidence SNPs from the completely sequenced genomes of the strains $288c, 
RMI1-la and YJM145. Specifically, we examined 13,839 SNPs for which 
YJM145 and RM11-la had the same allele and differed from the reference 
sequence. For this set, 12,578 (91%) and 11,518 (83%) SNPs were detected 
before grouping in YJM145 and RM11-1a, respectively. After grouping, 9,119 
SNPs were detected in at least one strain, and 8,086 were correctly called in both 
strains, from which we infer a false-negative rate of 5.7% per strain, given detec- 
tion in at least one strain. The grouping procedure almost never separated the 
same site into multiple sites (one case across the genome), and rarely combined 
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two distinct sites (394 cases; <<5% of sites after grouping). These cases are typi- 
cally SNPs that are located within 4 bp of each other, closer than the theoretical 
resolution of the YTMs. We also removed all singletons (SNPs called in only one 
strain) to reduce false positives further. 

Tree building and Structure analysis. We constructed a neighbour-joining tree 
of the 63 strains from the SNP data using the software package Splitstree”’, with 
branch lengths proportional to the number of segregating sites that differentiate 
each node. We ran Structure using the linkage model with the population num- 
ber parameter, K, set from 2 to 6, for 100,000 iterations after a burn-in of 100,000 
iterations, the first 50,000 of which were run under the free-recombination 
model’’. 

Linkage disequilibrium. We calculated linkage disequilibrium across the gen- 
ome using two standard metrics: D’ and 7°. We computed these statistics for all 
pairs of sites located within a given distance, both for all the strains and within 
each predefined subpopulation. To correct for finite-size effects and differences 
in sample size among the subpopulations, we subtracted from each statistic the 
average value for a random subset of SNP pairs located on different chromo- 
somes (which should not show linkage disequilibrium). 

Polymorphism and divergence statistics. We calculated population genetic 
summary statistics of polymorphism using code based on the libsequence pack- 
age*®. To correct for the removal of singleton SNPs in the data set, modified 
estimators of the population mutation parameters Ow and 7 were used**. An 
analogue of Tajima’s D was calculated as the difference between these modified 
estimates’. To obtain significance values, we simulated under a modified coales- 
cent model as described below, conditioning on the observed number of segreg- 
ating sites and the approximate length of the sequence. The significance of an 
observed statistic was then taken to be the probability of observing a more 
extreme value in at least 10,000 simulations. Divergence rates were calculated 
from the multiple species alignments of ref. 4. We used PAML” to obtain 
maximum likelihood estimates of the rate of evolution along the S. cerevisiae 
branch after divergence from Saccharomyces paradoxus. 

Coalescent simulations of genome evolution were performed using 
FastCoal’’. Output from each coalescent simulation was run through a series 
of steps to mirror the sources of error inherent in the SNPscanner data. First, 
called SNPs were randomly removed with a probability of 5%. The addition of 
randomly missed calls creates a characteristic dearth of high-frequency SNPs in 
the data set; simulations under a 5% false-negative rate fit very closely with the 
observed pattern of polymorphism at high frequency. To correct for incorrectly 
grouped SNPs, we performed the previously described grouping procedure on 
all simulated data. 
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Antiviral immunity in Drosophila requires systemic 


RNA interference spread 


Maria-Carla Saleh't, Michel Tassetto’*, Ronald P. van Rij'*+, Bertsy Goic”, Valérie Gausson’, Bassam Berry”, 
Caroline Jacquier®, Christophe Antoniewski°? & Raul Andino' 


Multicellular organisms evolved sophisticated defence systems to 
confer protection against pathogens. An important characteristic 
of these immune systems is their ability to act both locally at the site 
of infection and at distal uninfected locations“. In insects, such as 
Drosophila melanogaster, RNA interference (RNAi) mediates anti- 
viral immunity’’. However, the antiviral RNAi defence in flies 
seems to be a local, cell-autonomous process, as flies are thought 
to be unable to generate a systemic RNAi response®. Here we show 
that a recently defined double-stranded RNA (dsRNA) uptake path- 
way’ is essential for effective antiviral RNAi immunity in adult flies. 
Mutant flies defective in this dsRNA uptake pathway were hyper- 
sensitive to infection with Drosophila C virus and Sindbis virus. 
Mortality in dsRNA-uptake-defective flies was accompanied by 
100-to 10°-fold increases in viral titres and higher levels of viral 
RNA. Furthermore, inoculating naked dsRNA into flies elicited a 
sequence-specific antiviral immune response that required an 
intact dsRNA uptake pathway. These findings suggest that spread 
of dsRNA to uninfected sites is essential for effective antiviral 
immunity. Notably, infection with green fluorescent protein 
(GFP)-tagged Sindbis virus suppressed expression of host-encoded 
GFP at a distal site. Thus, similar to protein-based immunity in 
vertebrates, the antiviral RNAi response in flies also relies on the 
systemic spread of a virus-specific immunity signal. 

On the basis of the recent identification of a dsRNA uptake path- 
way in flies”'®, we hypothesized that dsRNA produced and released 
from infected cells can be taken up locally, and perhaps at distal sites, 
to establish systemic pre-existing immunity in uninfected cells 
(Fig. 1). We thus examined whether naked dsRNA can mediate sys- 
temic RNAi spread by inoculating flies with dsRNA corresponding to 
two different regions of the Sindbis virus genome (dsSin1 and dsSin2, 
Supplementary Fig. la; see also Fig. 2a). Two days after dsRNA 
inoculation, flies were infected with a recombinant Sindbis virus 
expressing GFP (Sindbis-GFP virus, Supplementary Fig. 1a). 
Notably, inoculation with dsSin1 and dsSin2 markedly reduced accu- 
mulation of GFP as determined by fluorescence microscopy and 
immunoblotting (Fig. 2b, c, lanes 7-11 and 18-22); control buffer 
had no effect on virus replication (Fig. 2b, c, lanes 2-6 and Supple- 
mentary Fig. 1b). This inhibitory response was sequence specific 
because flies inoculated with dsRNA corresponding to Drosophila 
C virus (DCV) genome showed no effect on Sindbis virus replication 
(Fig. 2b, c, lanes 13-17). Furthermore, inoculation of dsRNA corres- 
ponding to DCV (dsDCV) efficiently protected wild-type flies against 
DCV infection, but not against Sindbis virus (Supplementary Fig. 2a). 
The antiviral effect of exogenous dsRNA inoculation required a func- 
tional RNAi machinery as Dicer 2 and Argonaute 2 null mutant flies 


(Der2~/~ and Ago2 ’ ~ ) were unable to mount an effective antiviral 
response (Fig. 2d and Supplementary Fig. 2a). In addition, wild-type 
flies accumulated short interfering RNAs (siRNAs) derived from 
injected dsRNA (Supplementary Fig. 2c). We conclude that inocu- 
lation of dsRNA initiates a bona fide, specific RNAi response that 
protects flies against virus infection. 

Serial dilutions of dsSin2 indicated that very low concentrations of 
injected dsRNA sufficed to mount a very strong response (Fig. 2e). 
Accordingly, we observed reduced viral replication even after inocu- 
lation of 5 pg of dsRNA (equivalent to 1.5 X 10° molecules of dsSin2, 
Fig. 2e, lanes 17-20). Of note, whereas the maximal dose of dsSin2 
(5 ng) elicited an inhibitory response that lasted 5 days (Fig. 2e, lanes 
5-8), inoculation of a lower dose produced a shorter period of 
immunity (Fig. 2e, compare lanes 5—8 with 9-12, 13-16 and 17-20). 
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Figure 1| Model for systemic RNAi viral immunity in Drosophila 
melanogaster. Upon viral infection, virus-specific dsRNAs (for example, 
replication intermediates) are generated during the initial rounds of virus 
replication. After cell death or lysis, dsRNAs are taken up and processed by 
uninfected cells to protect them from subsequent infection, thereby 
preventing virus spread. 
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Figure 2 | In vivo dsRNA immunization provides sequence-specific 
antiviral protection in D. melanogaster. a, Immunization protocol. 

b, c, Wild-type flies infected with Sindbis—GFP virus two days after 
intrathoracic injection of dsRNA against Drosophila C virus (DCV, 442 bp 
long, corresponding to the viral polymerase between nucleotides 
5589-6030), dsRNA against Sindbis virus non-structural proteins 1 and 2 
(dsSin1, 901 bp long, corresponding to nucleotides 1211-2112) or dsRNA 
against Sindbis virus corresponding to the non-structural proteins 3 and 4 
(dsSin2, 954 bp long, corresponding to nucleotides 5485-6439). Buffer, 


This observation underscores the efficiency and persistence of the 
dsRNA-mediated antiviral immunity in Drosophila, and supports 
the idea that exogenous dsRNA can initiate an RNA silencing response 
in flies, albeit without the RNAi amplification mechanism observed in 
plants and nematodes’. 

We previously described that dsRNA is taken up in Drosophila S2 
cells by an active pathway, involving receptor-mediated endocytosis’. 
To examine whether this pathway is involved in the antiviral RNAi 
response mechanism we selected three genes implicated by the pre- 
vious analysis in dsRNA uptake: egghead (egh), encoding a seven- 
transmembrane-domain glycosyltransferase; ninaC, coding for a 
protein involved in vesicle transport; and a gene of unknown func- 
tion, CG4572 (Supplementary Fig. 2b). Although viability and ferti- 
lity of homozygous egh"’®"*, ninaC’ and CG4572°° mutant flies 
did not differ significantly from wild type, all three mutants were 
hypersensitive to DCV or Sindbis virus infection. In these dsRNA 
uptake-defective flies, we observed an earlier onset of disease (Fig. 3a, 
b). After infection, median survival of homozygous egh’?®"4, ninaC? 
and CG4572* flies was approximately 5-8 days, compared with 
more than 14 days in wild-type flies, and the 50% lethal dose (LD5o) 
in CG4572”® flies was ninefold lower than in wild-type flies (not 
shown). 

An important consideration when studying viral sensitivity in ani- 
mals defective for components of a major cellular pathway, such as 
endocytosis or intracellular transport, is that enhanced death after 
viral infection may be caused by a decrease in fitness or general health 
of the mutant animal, and not by a direct antiviral activity of the 
deleted component. To establish whether the increased mortality of 
egh, ninaC and CG4572 mutant flies stems from their inability to 
control virus replication, we determined viral loads (Fig. 3c). Even 
at early time points after infection, before the onset of disease, DCV 
titres were 100- to 10°-fold higher in homozygous egh"’®"*, ninaC? 
and CG4572"® flies compared to wild-type controls (Fig. 3c). The 
increase in viral titres in mutant flies was mirrored by a marked 
increase in viral RNA levels. Whereas viral RNA was barely detected 
in wild-type flies before day 5, it was clearly observed at 24h after 
infection in homozygous ninaC’ and CG4572* mutant flies, and 
by 48 h it accumulated at much higher levels in these mutants than in 
wild-type flies (Supplementary Fig. 3a). We examined the role of the 
dsRNA uptake pathway on virus replication further by monitoring 
Sindbis—GFP virus tissue tropism. In wild-type flies, GFP fluor- 
escence was barely detected 3 days after infection and accumulated 
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control injection; d.p.i., days post infection. Sindbis—GFP virus replication 
was monitored by GFP production. b, Fluorescence images. ¢, Western blot 
with an anti-GFP antibody. d, Sindbis—GFP virus challenge in wild-type, 
homozygous Der2"*!!** (Der2~'~) and homozygous Ago2*"* (Ago2 ‘~) 
flies. e, dsRNA immunization protects in a dose-dependent manner. Flies 
were inoculated with dsRNA (dsSin2) directed against Sindbis—GFP. Virus 
replication over time (2-5 d.p.i.) was monitored by western blotting using an 
anti-GFP antibody. 


in discrete puncta throughout the fly. In contrast, in homozygous 
egh* P04 ninaC’ and CG4572°* mutants, GFP accumulated within 
a large structure in the abdomen of the animal and at much higher 
levels than in wild-type flies (Fig. 3d and Supplementary Fig. 3b). 
These results indicate that the enhanced viral susceptibility of egh, 
ninaC and CG4572 mutant flies is due to their inability to control 
virus replication. 
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Figure 3 | Increased viral susceptibility of dsRNA-uptake-deficient 
mutants. a, b, Survival of dsRNA uptake mutant flies after virus infection. 
Homozygous egh"’®” (egh ‘~), ninaC’ (ninaC '), CG4572°°°° 
(CG4572 '~) and wild-type flies were injected with 500 TCIDso (50% tissue 
culture infectious dose) DCV (a) or 500 plaque-forming units Sindbis-GFP 
virus (b) and monitored daily for survival. c, DCV replicates at higher levels 
in dsRNA uptake mutant flies. Flies were injected with 500 TCID5) DCV, and 
virus production was monitored over time. At each time point, three pools of 
five flies were homogenized, and the viral titre in the homogenate was 
determined by end-point dilution. The error bars report the average + s.d. 
for at least three independent experiments. d, Sindbis—GFP virus replicates 
at higher levels in dsRNA uptake mutant flies as shown by increased GFP 
expression in the fat body at 3 days after infection when compared with wild- 


type flies. 
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The hyper-susceptibility to virus infection of flies defective in 
dsRNA uptake is strikingly similar to that previously seen in Ago2- 
defective flies®. Therefore, we examined whether the RNAi core func- 
tion is impaired in egh, ninaC and CG4572 mutant flies. Eye-specific 
silencing of the Ecdysone receptor gene (EcR) by an endogenously 
expressed EcR hairpin dsRNA” leads to abnormal eye structure 
resulting from impaired corneal lens formation (Fig. 4a, b). Under 
these conditions, disruption of the core RNAi machinery in homo- 
zygous Ago2*'* mutant flies suppressed EcR RNAi and restored 
normal eye structure. In contrast, efficient EcR RNAi was observed 
in homozygous egh’?®"4 and CG4572°"* flies. Similar experiments 
monitored RNAi in homozygous ninaC’ flies using the expression of 
a hairpin dsRNA targeting the white gene that causes a decrease of eye 
pigmentation and orange eye colour in control flies (Supplementary 
Fig. 4a, b)'*. Silencing of white was suppressed in homozygous 
Der2'*'* mutant flies whereas it was fully maintained in homo- 
zygous ninaC’ flies (Fig. 4c and Supplementary Fig. 4a, b). We further 
confirmed this conclusion by injecting dsRNA against the fushi tarazu 
gene (fiz)’* into syncytial embryos before cellularization. Injection of 
ftz dsRNA in wild-type embryos resulted in the expected segmenta- 
tion defects, namely loss of denticle belts in the cuticle of pre- 
hatching larvae (ftz phenotype; Supplementary Fig. 4c). Injection 
of ftz dsRNA in homozygous egh”’*"*, ninaC’ and CG4572°°” 
embryos induced the same defects, indicating that RNA silencing 
proceeded normally in these mutants. In contrast, homozygous 
Ago2*'* control embryos were unable to silence ftz expression and 
thus hatched with a wild-type cuticle (Supplementary Fig. 4c). These 
results indicate that mutant flies support efficient RNAi silencing if 
dsRNA uptake is bypassed through expression of dsRNA hairpins 
intracellularly or by injecting dsRNA into syncytial embryos. 
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Figure 4 | Core RNAi machinery and antibacterial immunity are intact in 
dsRNA uptake mutants. a, Schematic to test the core RNAi machinery 
integrality. b, RNAi processing of an inverted repeat IR [EcR] induced by the 
GMR-GAL4 driver prevents the formation of the corneal lens (electron 
microscopy picture, top panels are X200 original magnification, bottom 
panels are X500 original magnification). c, Monitoring corneal lens 
formation and eye colour in transgenic flies deficient in the dsRNA uptake 
pathway. d, Susceptibility of dsRNA uptake mutant flies to infection is 
specific to the viruses, as the dsRNA uptake mutant flies are able to produce 
antimicrobial peptides in response to an infection by pathogenic Gram- 
positive and Gram-negative bacteria. 
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We next examined whether other arms of the immune system were 
affected in egh, ninaC and CG4572 mutants. Insects produce a num- 
ber of antimicrobial peptides, which are secreted into the haemo- 
lymph, in response to immune challenge. These peptides are effective 
against Gram-negative and Gram-positive bacteria as well as fungi’. 
We determined whether egh, ninaC and CG4572 mutant flies can 
support production of the antimicrobial peptides drosomycin and 
diptericin in response to septic injury with Gram-positive and Gram- 
negative bacteria. Drosomycin production was measured after septic 
injury with Micrococcus luteus, a Gram-positive bacterium that sig- 
nals through the Toll pathway. Production of diptericin was mea- 
sured after septic injury with Erwinia carotovora (also called 
Pectobacterium carotovorum), a Gram-negative bacterium that 
induces the Imd pathway. Homozygous egh?®°4, ninaC’ and 
CG4572°"* flies were able to respond efficiently to bacterial infec- 
tion, (Fig. 4d). Similarly, the JAK/STAT signalling pathway’ seems 
to be unimpaired in egh’?®"* and CG4572”*® flies as DCV infection 
induced normal vir-1 expression in these mutants (Supplementary 
Fig. 5). Thus, defects in cellular components that abrogate dsRNA 
uptake and its ensuing antiviral immunity do not generally impair 
other arms of the fly innate immune system. 

We hypothesize that the dsRNA uptake pathway underlies sys- 
temic antiviral immunity, which is required to control virus replica- 
tion. We thus examined whether dsRNA inoculation in egh, ninaC 
and CG4572 mutant flies was able to elicit the protective immunity 
observed in wild-type flies (Fig. 2). Indeed, whereas inoculation of 
DCV dsRNA markedly reduced DCV replication in wild-type flies 
(Fig. 5a, lanes 1-9), homozygous egh’?®™, ninaC’ and CG4572° 
mutant flies were unable to mount an antiviral response on DCV 
dsRNA inoculation (Fig. 5a, lanes 10-30). Similarly, the dsRNA 
uptake pathway was required for protection against Sindbis virus 
infection by naked dsSin1 inoculation (Supplementary Fig. 6). 
Furthermore, whereas wild-type flies efficiently processed inoculated 
dsRNA into siRNAs, egh* Ps ninaC’ and CG4572°° mutant flies 
accumulated siRNAs at much lower levels (Supplementary Fig. 2b). 

Our model states that infected cells release viral dsRNA that is 
subsequently taken up by uninfected cells through the dsRNA uptake 
pathway, thereby eliciting an antiviral RNAi response. A direct pre- 
diction of this model is that during infection, viral-derived dsRNA 
spreads to induce systemic silencing. To test this prediction we exam- 
ined whether infection with a Sindbis virus carrying the GFP gene 
could silence a ubiquitously expressed endogenous GFP at a distal 
site. After intrathoracic inoculation, Sindbis—GFP virus RNA was 
readily detected in the thorax and abdomen of Tub-eGFP transgenic 
flies starting at 1 day after infection (Fig. 5b, lanes 5-8). In contrast, 
the viral RNA was not detectable in the head until day 5 after infec- 
tion (Fig. 5b, lanes 1-4). Notably, endogenous GFP expression in the 
head was significantly reduced already at day 2, despite the absence of 
any detectable viral replication in this organ (Fig. 5c, lane 3). In 
contrast, infection with control Sindbis virus carrying a firefly luci- 
ferase gene did not silence GFP expression (Fig. 5d). These results 
indicate that a virus-specific derived RNAi signal spreads from the 
thorax to the head early after infection. 

It was previously thought that Drosophila is unable to spread sys- 
temically an RNAi response, based on observations that endogenously 
expressed RNA hairpins do not spread from cell to cell’. However, we 
demonstrate that, upon virus infection, infected cells spread systemi- 
cally a silencing signal that elicits protective RNAi-dependent immu- 
nity throughout the organism. Although uninfected Drosophila cells 
seem to lack a constitutive mechanism for systemic RNAi spread, 
unlike plant and nematode cells, they do have an active and highly 
efficient mechanism for dsRNA uptake, which we here show is essential 
for antiviral immunity. Accordingly, dsRNA is normally not released 
from uninfected cells, but virus infection may induce dsRNA release 
either through lysis of infected cells or through a virally induced shed- 
ding mechanism. We propose that these virally derived dsRNAs are 
taken up into uninfected cells to generate virus-specific intracellular 
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Figure 5 | Systemic spread of dsRNA follows virus infection and is 
essential for effective antiviral immunity. a, Drosophila C virus infection in 
wild-type flies, and in homozygous egh"?® (egh”'~), ninaC? (ninaC '~) 
and CG4572°>? (CG4572~'~) mutant flies inoculated with the indicated 
dsRNA. DCV replication was monitored by western blotting using an 
antibody directed against DCV capsid protein VP1. b—-d, dsRNA produced 
during virus replication can spread and silence endogenous GFP expressed 


immunity that prevents virus spread (Fig. 1). In support of this idea, 
this specific antiviral response in flies requires both the RNAi core 
machinery and the recently described dsRNA uptake pathway. 
Furthermore, simple inoculation of even very low amounts of 
dsRNA, in the absence of virus infection, can by itself promote a potent 
antiviral immunity, which is similarly dependent on the RNAi core 
machinery and the dsRNA uptake pathway. Our previous results indi- 
cated that whereas dsRNA is readily taken up by Drosophila S2 cells, 
siRNAs are not efficiently taken up’. We thus conclude that systemic 
spread of a specific antiviral RNAi activity—probably mediated by 
large viral dsRNAs or intramolecular base-pairing structures released 
from infected cells—is an essential component of the immune response 
elicited by virus infection in flies. The precise nature of the RNAi spread 
intermediate remains to be defined further. 

It is remarkable that blocking the spread of the RNAi signal has 
such a profound effect on antiviral immunity. This suggests that the 
cell-autonomous RNAi response is insufficient to control a viral 
infection. In striking parallel to vertebrates, flies also rely on systemic 
immunity, albeit in this case the virus-specific signal is dsRNA-based. 
These observations provide an insight into the evolutionarily con- 
served principles of immunity in multicellular organisms, requiring 
both cell-autonomous responses as well as systemic mechanisms to 
create pre-existing immunity to protect uninfected cells. 


METHODS SUMMARY 

Fly stocks. dsRNA uptake mutant stocks were obtained from the Bloomington 
Drosophila Stock Center. The genomic structure of these mutant allele stocks was 
confirmed by inverse PCR and sequencing. The egh*’®”* allele is a P-element 
insertion in the coding sequence of the egh first exon. The ninaC’ allele is a 
replacement of the K1078 codon by a stop codon'*. The CG4572® allele is 
a PiggyBac insertion in the open reading frame of CG4572°°. The 
UAS>IR[EcR] transgene producing EcR dsRNA’ and the P{GAL4- 
ninaE.GMR}12 GAL4 driver were recombined on chromosome 2 before genetic 
crosses with egh®P804 and CG4572"* mutant stocks. The GMR>IR[ white] 
inverted repeat transgene has been previously described'*. The Tub-eGFP trans- 
genic line was obtained from S. Cohen”. 

dsRNA preparation and injection into adult flies. dsRNA was generated by in 
vitro transcription using T7 RNA polymerase. Five-day-old female flies were 
injected in the thorax using a nanoinjector (Nanoject IH, Drummond 
Scientific). Two days later flies were infected in the opposite side of the thorax 
with the appropriate virus. 

Viruses. Recombinant Sindbis virus expressing GFP during viral replication was 
generated by cloning enhanced GFP into the Xbal site of the double subgenomic 
Sindbis vector pTE3’2J (provided by C. Rice)’®. In vitro transcribed RNA was 
transfected into BHK-21 cells. Virus titre was determined by plaque assay on 
BHK cells. DCV stock preparation and viral titre calculation have been previ- 
ously described’. 
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at a distal site of infection. Flies expressing eGFP (Tub-eGFP) inoculated 
with Sindbis—GFP (b, ¢) or Sindbis—luciferase virus (d) by intrathoracic 
inoculation. b, Viral replication monitored by RT-PCR using primers that 
amplify NSP1/2 virus genes. c, Expression of endogenous GFP was 
monitored by western blot with an anti-GFP antibody. d, Same as ¢ except 
that flies were infected with Sindbis—luciferase virus. 


Microbial infection. Log phase of growth cultures of bacteria Erwinia carotovora 
and Micrococcus luteus were re-suspended in culture medium, and sharpened 
needles dipped into these suspensions. Flies were harvested at 6 and 36h after 
septic injury. RNA extraction and northern blots were performed following 
standard procedures. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cells, plasmids and viruses. Drosophila S2 cells (Invitrogen) were cultured at 
25 °C in Schneider’s Drosophila medium (Gibco), supplemented with 10% heat- 
inactivated fetal calf serum, 2mM t-glutamine, 100U ml! penicillin, and 
100 mg ml! streptomycin. Firefly (Photinus pyralis) and Renilla reniformis luci- 
ferase sequences from the plasmids pGL3 and pRL-CMV (Promega) were cloned 
into pMT/V5-HisB (Invitrogen), generating pMT-Luc and pMT-Ren allowing 
copper-inducible expression from a metallothionein promoter. 

Transfections were performed using Effectene transfection reagent (Qiagen) 
according to the manufacturer’s recommendations. Luciferase expression was 
assayed using the Dual-Luciferase Reporter Assay System (Promega) and analysed 
ona Tecan Ultra-evolution plate reader. Double-stranded RNA was generated by 
in vitro transcription from T7-promoter-flanked PCR products. DCV viral stocks 
were prepared on low-passage S2 cells and titred by end-point dilution. Briefly, 
25,000 S2 cells per well in a 96-well plate were inoculated with tenfold dilutions of 
viral stocks. Cells were transferred to fresh medium at day 7 and cytopathic effect 
was monitored visually over 14 days. Viral titres were calculated according to the 
method of Reed and Muench”. 

Recombinant Sindbis virus expressing GFP during viral replication was gener- 
ated by cloning enhanced GFP into the XbalI site of the double subgenomic Sindbis 
vector pTE3’2J (provided by C. Rice)’. The resulting plasmid was linearized and in 
vitro transcribed using the mMessage machine kit (Ambion). RNA was purified 
and electroporated into BHK-21 cells, and supernatant was harvested and virus 
titre determined by plaque assay on BHK cells. 

RNAi in S82 cells. The effect of downregulating NinaC, CG4572 and Egghead on 
dsRNA uptake was analysed in a silencing of luciferase expression assay. Cells 
were pre-treated with approximately 500-nucleotide-long dsRNA targeting egh 
(nucleotides 488-1103; 616-bp product), ninaC (nucleotides 161-761; 601-bp 
product), cg4572 (nucleotides 61-731; 671-bp product), or Ago2 (nucleotides 
214-865; 652-bp product), or with dsRNA targeting GFP as a negative control. 
Three days after knockdown of these gene products, the cells were co-transfected 
with an RNAi dual reporter system, consisting of firefly luciferase and Renilla 
luciferase expression plasmids. Then, dsRNA directed against firefly luciferase 
(nucleotides 66-658; 592-bp product) was either added to the culture superna- 
tant (soaking) or directly introduced into cells by co-transfection with the dual 
reporter plasmids (transfection). Twenty-four hours after dsRNA luciferase 
treatment, expression of luciferase was induced by adding CuSO, to the culture 
supernatant, and cell lysates were generated after an additional 18 h incubation. 
Microbial infection. The bacteria Erwinia carotovora and Micrococcus luteus 
were pre-cultured in LB medium. Pellets taken when the cultures were in the 
log phase of growth were re-suspended in a small amount of culture medium, 
and sharpened needles dipped into these suspensions. Flies were harvested at 6 
and 36h after septic injury. Total RNA extraction and northern blots were 
performed following standard procedures. 

Fly infections. Flies were reared on standard medium at 25°C. Ago2*"* and 
Der2'8"'®* flies have been described previously”. w'’* flies were used as wild- 
type controls. Two-to-three-day-old female flies were injected with 50 nl of the 
appropriate virus dilution in 10 mM Tris-Cl (pH 7.5) as described previously”, 
using a Drummond nanoject injector. Fly mortality at day one was attributed to 
damage invoked by the injection procedure, and these flies were excluded from 
further analyses. Mortality was monitored daily for 14 days, and every three to 
four days the flies were transferred to fresh food. In all experiments 40-60 flies per 
genotypic group were injected. Unless noted otherwise, female flies were used. No 
significant difference in survival was observed between flies after injection of 
buffer (data not shown). For northern blots, RNA was isolated from 25 flies using 
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Trizol reagent. Viral titres in the flies were determined by end-point dilution of fly 
homogenate of three pools of five flies. At the indicated time points, flies were 
harvested and stored at —70°C until further processing. We confirmed the 
absence of endogenous virus in fly stocks by titration of uninfected fly homo- 
genate on S2 cells. 

dsRNA preparation and injection into adult flies. dsRNA was generated by in 
vitro transcription using T7 RNA polymerase using as template PCR products 
corresponding to nucleotides 1211-2112 (NSP1/2) and 5485-6439 (NSP3/4) of 
the Sindbis virus genome, or nucleotides 5589-6030 of the DCV genome. Five- 
day-old female flies were CO2-anaesthetized and injected in the thorax with 50 nl 
of the appropriate dsRNA (Smgml') using a nanoinjector (Nanoject II, 
Drummond Scientific). Two days later flies were CO-anaesthetized and injected 
in the opposite side of the thorax with the appropriate virus dilution in 10 mM 
Tris-Cl (pH 7.5). Injection of the same volume of 10 mM Tris-HCl, pH 7.5, was 
used as a control. Age of the flies and amount of dsRNA injected was determined 
according to ref. 24. Virus infection has been described previously’. 

Western blot analysis. For protein analysis, equal amounts of protein from total 
fly extracts were boiled in Laemmli buffer and loaded on 10% SDS-PAGE. After 
transfer nitrocellulose membranes were blocked in 5% milk, 1X PBS, 0.1% 
Tween, and incubated overnight with rabbit polyclonal anti-GFP (Santa Cruz 
Biotechnology) or rabbit polyclonal anti-VP1 (custom made). For normaliza- 
tion a monoclonal antibody anti-«-tubulin (Sigma Aldrich) was used. Detection 
was performed using Supersignal West Pico Chemiluminescent Substrate 
(Pierce). 

Northern blots. Total RNAs were extracted from whole flies using Trizol 
(Invitrogen). 15 jg of total RNA was size fractionated on 1% (w/v) agarose gels 
containing 1.1mM formaldehyde. After electrophoresis, the RNA was trans- 
ferred overnight by capillarity to a nylon membrane (Nytran Supercharge; 
Schleicher and Schuell) and covalently bound to the membrane using a 
Stratalinker UV crosslinker. Northern blots were hybridized with DNA probes 
generated by a random-primed labelling reaction and [a.-**P]dCTP. Membranes 
were exposed overnight to a PhosphorImager screen at room temperature. Viral 
RNA was detected by northern blot using standard procedures with a random 
primed DNA probe corresponding to nucleotides 1947-2528 of DCV. 
Oligonucleotide primers. All the primers used to produce dsRNA had a T7 
promoter sequence (TAATACGACTCACTATAGGGAGA) at the 5’ end. 
DCVpol forward, 5'-CAACGAATATGTCGCCTTGA-3’; DCVpol reverse, 
5'-TTGGTTGTACGTCAAAATCTGAG-3’; SINnsp1 forward, 5’-TCTGCCGA- 
TCATAGCACAAG-3’; SINnsp2 reverse, 5’-CCTTCTTAACGCAACGCTTC-3’; 
SINnsp3 forward, 5’-GAGGATCAATTTTCGACGGAGA-3’; SINnsp4 reverse, 
5'-GATTGAATGTCGCTGAGTCCAG-3’; vir-1 forward, 5'-TTCGATTCCTC- 
AGACGATGA-3’; vir-1 reverse, 5'-GGTCAATGGGCACAAAGTTC-3'; Rp49 
forward, 5’-AAGGGTATCGACAACAGAGTGC-3’; Rp49 reverse, 5’-ACAA- 
ATGTGTATTCCGACCACG-3’. 
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Regulatory T-cell suppressor program co-opts 
transcription factor IRF4 to control T,2 responses 


Ye Zheng’*+, Ashutosh Chaudhry’”’+, Arnold Kas”, Paul deRoos’”+, Jeong M. Kim’”, Tin-Tin Chu’”, 
Lynn Corcoran’, Piper Treuting’, Ulf Klein? & Alexander Y. Rudensky’*+ 


In the course of infection or autoimmunity, particular transcrip- 
tion factors orchestrate the differentiation of Ty1, T}q2 or Ty17 
effector cells, the responses of which are limited by a distinct 
lineage of suppressive regulatory T cells (Tyeg). Treg cell differenti- 
ation and function are guided by the transcription factor Foxp3, 
and their deficiency due to mutations in Foxp3 results in aggress- 
ive fatal autoimmune disease associated with sharply augmented 
Ty1 and Ty2 cytokine production’”’. Recent studies suggested that 
Foxp3 regulates the bulk of the Foxp3-dependent transcriptional 
program indirectly through a set of transcriptional regulators ser- 
ving as direct Foxp3 targets*’. Here we show that in mouse T,.g 
cells, high amounts of interferon regulatory factor-4 (IRF4), a 
transcription factor essential for T};2 effector cell differentiation, 
is dependent on Foxp3 expression. We proposed that IRF4 
expression endows T;eg cells with the ability to suppress T}2 res- 
ponses. Indeed, ablation of a conditional Irf4 allele in T,., cells 
resulted in selective dysregulation of T};2 responses, IL4-depend- 
ent immunoglobulin isotype production, and tissue lesions with 
pronounced plasma cell infiltration, in contrast to the mononuc- 
lear-cell-dominated pathology typical of mice lacking T,,.. cells. 
Our results indicate that T,., cells use components of the tran- 
scriptional machinery, promoting a particular type of effector 
CD4* T cell differentiation, to efficiently restrain the correspond- 
ing type of the immune response. 

Treg Cell deficiency results in activation and expansion of CD4* 
and CD8* T cells, dendritic cells, granulocytes and macrophages, and 
greatly increased production of a wide range of cytokines including 
interleukin (IL)-2, Ty1 and T}2 cytokines®’. Expression of Foxp3 is 
required for the establishment and maintenance of Tg lineage iden- 
tity and suppressor function*”’. Our recent study suggested that in 
Treg cells Foxp3 might regulate expression of IRF4 (refs 12-14) a 
transcription factor that is indispensable for T},2 effector cell differ- 
entiation'»'®. Furthermore, a recent study suggested a prominent role 
for IRF4 in Ty17 differentiation’’. Thus, we decided to examine a role 
for IRF4 in Tyeg cell differentiation and function. 

Foxp3 binding within the promoter region of Irf4 in T,.. cells* was 
confirmed by chromatin immunoprecipitation (ChIP)-coupled 
quantitative PCR (qPCR) (Supplementary Fig. 1a, b). Irf4 messenger 
RNA was increased in thymic and peripheral Foxp3* Treg Cells in 
comparison to CD25” Foxp3. CD4~ T cells (data not shown)’. 
Furthermore, Foxp3 knockdown using a retrovirally encoded 
Foxp3-specific short hairpin RNA resulted in a marked diminution 
in Irfé mRNA (Supplementary Fig. lc), suggesting that Foxp3 
directly regulates IRF4 expression in Tyeg cells. 

Next we induced deletion of a conditional Irf4 allele (Irf4" ) in Treg 
cells by crossing Irf#” mice to Foxp3°" mice expressing yellow 


fluorescent protein (YFP)-Cre recombinase fusion protein under 
the Foxp3 locus control!*’. The Irf4’ allele has a ‘built-in’ reporter 
capacity in that Cre-mediated recombination results in the deletion 
of the Irf4 promoter and the exon containing the translational start, 
and the concomitant expression of green fluorescent protein (GFP)"*. 
The specificity of the Irf4" deletion was examined by flow cytometric 
analysis of Foxp3 expression in a sorted GEP* CD4~ T cell popu- 
lation that underwent Cre-mediated recombination, and in a 
GFP CD4* T cell population that did not. Essentially all GFP* cells 
expressed Foxp3, whereas GFP cells lacked Foxp3 expression 
(Fig. 1a). Flow-cytometric analysis of control Irf4”* Foxp3“* mice 
showed that IRF4 expression was markedly increased in all peripheral 
Treg cells, but only modestly in Foxp3* CD4* thymocytes, whereas 
IRF4 levels were undetectable in corresponding Foxp3 cell subsets. 
On Cre-mediated deletion in Irf4” Foxp3“’ mice, IRF4 protein 
became undetectable in both peripheral and thymic Foxp3* cells 
(Fig. 1b). Ifa" Foxp3©” and Irf4”~ Foxp3\* mice harbouring an 
IRF4-deficient T,.. subset were born at the expected Mendelian ratio 
and were indistinguishable from their wild-type or heterozygous 
littermates during the first month of life. However, by 6-8 weeks of 
age Irf4”" Foxp3“" and Irf#”— Foxp3“ mice manifested identical 
autoimmune disease—including lymphadenopathy, weight loss, 
blepharitis and dermatitis—and succumbed to disease at 3-4 months 
of age (Fig. 1c, d and data not shown). Histopathological evaluation 
of diseased mice showed massive infiltration in the pancreas, lung 
and stomach, whereas control littermates did not show any notice- 
able pathology. For comparison to a complete T,.g cell deficiency, we 
analysed tissue lesions in Foxp3?'® knock-in mice (expressing 
human diphtheria receptor, DTR under control of Foxp3 locus) that 
were subjected to chronic ablation of a Tye, cell subset caused by 
diphtheria toxin treatment, starting from birth. These mice showed 
analogous lesions in the pancreas and stomach, and much more 
severe lesions in the lung in comparison to mice harbouring an 
IRF4-deficient T,.g subset (Fig. le and data not shown). In contrast 
to the massive liver lesions observed after T,., depletion, livers in 
mice containing an IRF4-deficient T,., subset were unaffected, but 
kidneys showed the opposite trend (Supplementary Fig. 2 and data 
not shown). Furthermore, flow cytometric analysis showed expan- 
sion and activation of peripheral T cells but not dendritic cells in 
Irf4" Foxp3 mice, in contrast to the increased numbers of acti- 
vated dendritic cells observed in T,-.-deficient mice (Fig. 2a, b and 
data not shown). 

The aforementioned phenotypic differences between mice lacking 
Treg cells and mice containing IRF4-deficient T,., cells could be due 
to a numerical decrease in the T,.g cells in the absence of IRF4. 
However, the IRF4-deficient T,.¢ subset was markedly increased in 
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Figure 1| Ablation of IRF4 in T,.. cells results in autoimmune 
lymphoproliferative disease. a, Foxp3“’-mediated deletion of Irf4 is 
restricted to T,¢g cells. Spleen and lymph node CD4* T cells from 

If" Foxp3" mice were FACS-sorted into GFP* and GFP cells 
containing a recombined and unrecombined Irf#" allele, respectively, and 
stained for Foxp3. The post-sorting purity of GFP” and GFP population 
was greater than 97%. b, Intracellular staining of IRF4 in splenic and thymic 
cpD4* Foxp3 (red) and cpD4* Foxp3~ (blue) cells and splenic 

B220- CD138* (green) plasma cells (positive control) in Irf#”* Foxp3\” 
(top panels) and Irf4" Foxp3“ mice (bottom panels). ¢, Splenomegaly and 
lymphadenopathy in Irf4”" Foxp3“” mice. d, Spleen and lymph node 
cellularity in Irf#”* Foxp3” (filled circles) and Irf4" Foxp3“” (open 
circles) mice. e, Histopathology induced after IRF4 ablation in Teg cells. 
Representative haematoxylin and eosin (H&E)-stained tissue sections from 
8-week-old Irf4*/~ Foxp3“” and Irf4 Foxp3“” mice and diphtheria-toxin- 
treated Foxp3?"® mice. Note pronounced infiltrates in the lung, pancreas 
and stomach of Irf4”~ Foxp3“" mice and severe lesions in tissues from 
diphtheria-toxin-treated Foxp3°"® mice (n = 3-5). Original magnification 
(c, e), X10. 


diseased Irf4”" Foxp3\ mice compared to Irf4”* Foxp3“ litter- 
mate controls, probably in response to T cell activation (Fig. 2c, d). 
IRF4 deficiency did not change Foxp3 protein expression on a per cell 
basis (Fig. 2c), excluding the possibility of down-modulation of 
Foxp3 in Tyeg cells in the absence of IRF4 as a reason for their 
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functional incompetence. These results pointed to impairment in a 
particular aspect of T,., function on IRF4 ablation. 

Indeed, analysis of cytokine production by CD4* T cells from 
Irf4" Foxp3\ mice showed a notable increase in the numbers of 
IRF4-sufficient Foxp3 CD4* T cells producing the Ty2 cytokines 
IL-4 and IL-5 (Fig. 3a, b), whereas IRF4-deficient Foxp3~* Treg Cells, 
unlike their IRF4-sufficient counterparts, were unable to produce 
effector cytokines (Supplementary Fig. 3). The production of IL-2 
and the Ty] cytokine IFN-y by CD4* T cells was largely unchanged, 
but production of the Ty17 cytokine IL-17 was marginally increased 
in Irf4" Foxp3° mice compared to Irf#”* Foxp3“” controls 
(Fig. 3a, b and data not shown). In agreement with these results, 
enzyme-linked immunosorbent assay (ELISA) showed high amounts 
of the Ty2 cytokines IL-5, IL-13 and IL-10 in the supernatants of 
CD3-stimulated splenic Irf4”" Foxp3°* T  cell cultures 
(Supplementary Fig. 4). In contrast, a lack of Ty.g cells in Foxp3~ 
mice led to sharply increased production of IFN-y and IL-4 by CD4* 
T cells (Supplementary Fig. 5). Massive increases in IFN-y and IL-2 
production were also observed after ablation of Ty eg cells in Foxp3?1® 
mice’ (J. M. Lund and A.Y.R., unpublished observations). Thus, T};2 
effector CD4* T cell responses seem to be selectively dysregulated in 
mice containing IRF4-deficient Teg cells. Notably, the augmented 
T,,2 responses appeared in Irf4“" Foxp3“ mice only by 3—4 weeks of 
age, whereas both T}2 and Ty1 cytokine production in 11-day-old 
Irf4!" Foxp3™ and Irf4”* Foxp3“” mice was low (data not shown). 

Because IL-4 is an important cytokine that promotes IgG1 and IgE 
class-switch recombination, we measured levels of serum immuno- 
globulin isotypes in diseased 8-week-old Irf4“" Foxp3“” and litter- 
mate control Irf#”* Foxp3°" mice. As a positive control, we also 
analysed 3—4-week-old T,eg-deficient Foxp3 mice. Consistent with 
an uncontrolled Ty;2 response, IL-4-dependent IgG1 and IgE serum 
concentrations reached very high levels (10-30 mg ml”! and 0.75- 
1.5mgml |, respectively) in Irf4“" Foxp3\’ mice, whereas serum 
IgM and IgA concentrations were only modestly increased in com- 
parison to control mice. However, the amounts of the remaining 
immunoglobulin isotypes, including IFN-y-dependent IgG2a, were 
diminished in the serum of afflicted mice (Fig. 4a). Unlike 
Irf4" Foxp3\ mice, Foxp3 mice showed in addition to increased 
IgG] and IgE, sharply increased amounts of IgG2a consistent with the 
increased IFN-y production in these mice (Fig. 4a). Furthermore, we 
observed ubiquitous germinal centre formation and increased num- 
bers of plasma cells in the spleens of diseased Irf4”" Foxp3“” mice, 
but not in the littermate controls (Fig. 4b and data not shown). In 
Ifa" Foxp3™ mice, 15% and 67% of plasma cells produced IgE and 
IgG1, respectively, in contrast to 3% IgG1-producing and undetect- 
able numbers of IgE-producing plasma cells in control mice 
(Supplementary Fig. 6). Furthermore, examination of diseased mice 
harbouring an IRF4-deficient T,., cell subset revealed a predom- 
inance of cells with a characteristic plasma cell morphology infiltrat- 
ing the pancreas, kidney and stomach (Fig. 4c and data not shown). 
In contrast, the pancreatic and other tissue infiltrates in Foxp3?'® 
mice subjected to chronic T,.g ablation, or in Foxp3 mice, consisted 
predominantly of macrophages, lymphocytes and neutrophils with 
very few plasma cells (Fig. 4c and data not shown). We also tested 
whether the dysregulated T},2 response in the presence of IRF4-defi- 
cient Tyg cells promoted autoantibody production. Indeed, serum 
IgG1 from affected Irf4" Foxp3\’ mice showed robust reactivity 
with several tissue antigens, whereas minimal reactivity was found 
in control animals (Supplementary Fig. 7). 

The distinct immune activation profile and tissue lesions in dis- 
eased mice harbouring an IRF4-deficient T,., cell subset were in 
agreement with the previously mentioned idea that only a part of 
the suppressor program was impaired in IRF4-deficient T, eg cells. In 
further support of this notion, the in vitro suppressor capacity of 
IRF4-deficient Tyg cells isolated from Irf4™" Foxp3Cre mice was lar- 
gely intact (Fig. 5a). Furthermore, cell surface amounts of two putat- 
ive T,.g suppressor effector molecules CTLA4 and CD73 (also known 
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Figure 2 | Increased numbers and activation of CD4* T cells in mice 
harbouring IRF4-deficient T,., cells. a, CD4~ and CD8* T cell numbers in 
the spleen (sp) and lymph nodes (In) of Irf#”" Foxp3°” (open circles) mice 
and Irf4”* Foxp3“” (filled circles) littermate control mice. NS, not 
significant. b, Flow cytometric analysis of CD44, CD62L and CD69 
expression on CD4* T cells in 8-week-old Irf4“" Foxp3“” mice and 
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Irff”* Foxp3“ littermates. A representative of three independent 
experiments is shown. c, d, Increased Foxp3* Treg cell subset in 

Irf4"" Foxp3“ mice. Flow cytometric analyses of spleen and lymph node 
cells from Irf4”* Foxp3™ (filled circles) and Irf4" Foxp3“” (open circles) 
mice. A representative of three independent experiments is shown. 


as NT5E) were not altered in IRF4-deficient T,., cells. The levels of 
CD25 (IL2RA) and GITR (TNFRSF18) were also unchanged (Fig. 5b 
and data not shown). However, ICOS expression was decreased, 
suggesting that IRF4 regulates expression of a distinct subset of func- 
tionally important genes in Tyg cells. 

An unbiased assessment of the effect of IRF4 deficiency on Tyeg 
transcriptome demonstrated that the expression of up to 20% of 
‘Tyeg-Specific’ genes decreased in the absence of IRF4, whereas 
approximately 7% were increased (Fig. 5c). Recent gene targeting 
studies demonstrated that T,., cells use several suppressor modalit- 
ies'”*°, Examination of the IRF4-dependent subset of ‘T,,..’ genes and 
the subsequent independent confirmation by qPCR showed signifi- 
cantly decreased expression of several genes (Fgl2, 1110 and Gzmb) 
encoding putative suppressor effector molecules; a few (Ebi3 and 
Entpd1) were only marginally decreased, whereas others (Tgfb1 and 
Ctla4) did not change (Fig. 5d, e and data not shown). It is likely that 
changes in expression of a combination of genes, but not a single 
gene, account for the impaired suppressor capacity of IRF4-deficient 
Treg Cells. In this regard, we recently showed a role for IL10 in Tyeg- 
mediated suppression”. In addition, ICOS-deficient T,.. cells were 
inferior in comparison to their wild-type counterparts in limiting 


Figure 3 | IRF4 deficiency in T,g cells results in a selective failure to control 
Ty2 responses. a, b, Flow cytometric analysis of cytokine production by 
splenic CD4* T cells from Irf4”* Foxp3©” mice and Irff" Foxp3 
littermates. Splenocytes were stimulated with CD3 (5 j1gml_') and CD28 
(5 ug ml) antibodies in the presence of Golgi-Plug (1 :gml') for 5h 
before staining for CD4, CD8 and the indicated cytokines. A representative 
of three independent experiments is shown. 
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Figure 4 | Increased serum IgG1 and IgE concentration, germinal centre 
formation, and plasma cell tissue infiltration caused by IRF4 deficiency in 
Treg cells. a, Analysis of immunoglobulin isotype amounts in sera of 8-week- 
old Irf#”" Foxp3“” mice and Irf#”* Foxp3™ littermates, and of 3—-4-week- 
old Foxp3” and Foxp3* littermates. b, Immunofluorescent staining of 
germinal centre B cells (GL7‘, green), follicular B cells (IgD*, red), and 
CD4* T cells (blue) in spleens of Irf4" Foxp3“” mice and Irf4”* Foxp3°" 
littermates. Original magnification, X20. ¢, Histological sections of H&E- 
stained pancreas from 8-week-old Irf4"" Foxp3“” mice and Treg-deficient 


expansion of effector T cells in lymphopenic hosts (Y.Z., unpublished 
observations). 

The observation that ICOS—the expression of which is critically 
important for T,,2 differentiation’*—was expressed in T,g cells in an 
IRF4-dependent manner suggested that IRF4 might be involved in 
the regulation of a functionally important transcriptional module 
shared by both T,,., and T42 effector T cells. To test this hypothesis, 
we cross-referenced the IRF4-dependent gene data set derived from 
our Tyeg cell studies to an effector T};2-specific gene data set. This 
comparison and independent qPCR analysis showed that in addition 
to Icos, the expression of two important T}2 genes Mafand Ccr8 was 
compromised in IRF4-deficient T,.g cells, and the expression Il Ir! — 
another essential player in T},2 differentiation—was also decreased 
(Fig. 5f-h)?7?. 

Next, using Icos as a model gene, we sought to investigate the 
possibility that in T,., cells IRF4 and Foxp3 co-operate in transcrip- 
tional regulation. Using CLOVER, a transcription factor binding site 
prediction algorithm’’, we identified a putative IRF4 binding site 
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diphtheria-toxin-treated Foxp3?'® mice. The Irf4”~ Foxp3“” pancreas is 
infiltrated primarily by plasma cells (arrows), that is, distinct round cells 
containing an eccentric nucleus with a cartwheel chromatin appearance and 
perinuclear clearing (inset; original magnification, 60). In contrast, the 
pancreatic infiltrates of diphtheria-toxin-treated Foxp3?™* mice contained 
principally macrophages (arrows), that is, large cells with abundant 
eosinophilic cytoplasm, reniform to oval nuclei, and indistinct cell borders 
(inset; original magnification, < 60). Original magnification for both panels, 
x40. Representative sections are shown. 


within the Icos promoter corresponding to the Foxp3 binding sites* 
(Supplementary Fig. 8), and confirmed binding of both Foxp3 and 
IRF4 by ChIP-coupled qPCR (Fig. 5i). These results raised the pos- 
sibility that Foxp3 physically interacts with IRF4. In fact, this inter- 
action was confirmed by IRF4 and Foxp3 co-immunoprecipitation 
from T,g cell nuclear lysates using a Foxp3-specific antibody (Fig. 5j). 
These results indicate that IRF4 binds to Foxp3 and the resulting 
complexes affect expression of certain target genes such as Icos. 
Our results indicate that Foxp3 induces IRF4 expression in Tyeg 
cells. A transcriptional module downstream of IRF4 is probably 
modified after IRF4 interaction with Foxp3 to facilitate efficient 
Ty2 suppression by Tye, cells. Uniformly increased IRF4 protein 
expression in peripheral T,., cells seems to suggest that they are 
equally poised to suppress T}42 responses. However, Tyg cell popula- 
tions may still be heterogeneous in this regard, if signalling- 
dependent post-translational modifications of IRF4 or recruitment 
of other nuclear factors are needed for T,..-mediated T};2 suppres- 
sion. We propose that T,,.¢ cells might hijack certain components of 


©2009 Macmillan Publishers Limited. All rights reserved 


NATURE|Vol 458|19 March 2009 


a 
20,000 


i \ fy 
= 15.000 60 i | 60 I \\ I f\ 
i onl ! 

fh | Ir \| 

5 40,000 40 ff i a \\ 1 a 

20 20. Vt \ { \ 

5,000 0 0 0 Son 

010? 10? 104 10° 010? 103 104 105 0102 103 104 105 
> > 


oL4—___—_—_—, ~ 
Ty it 12 14 18 Te Coes Cue oe 
Tp:T_ ratio 
c d e 
Exp1 Exp2 
XP Xp: P 
= 
8 1,200 ok 0 
§ 1,000 ats 
2 800 2s 
© 600 23a 
3a aes 
200 222 
E 0 4321 0-1-2-3-4 Se = 
Zz Up Down a wg -8 
Log2 ratio Wb AN oe OS 1 
WP WH Wh gor _® 
oes STV ee ¢ 
h 
Exp1 Exp2 a bal 2 
Mat De 2 ar 
i, os 0 oa 0 
2S 2S8 -2 
I £6 -2 =o 
fi sa Saf 4 
22s-4 BLD 6 
O84 oge 
oe a Qe -8 
x -8 *< 
is % HS 49 
wh we Ww Dd P ww 
ar of Ww 
i J Input IP 
good 
ee me Re) 
x KX KK 
o 90 00 
bib Wb i 
PP eS 
: aagaa 
IP: Foxe3 6 G6 OO 
IRF4 om ee 
WB |Foxp3 = 
Icos Gmpr ps — 


Figure 5 | IRF4 interacts with Foxp3 and diminished expression of a subset 
of suppressor effector and T,,2 specific genes in IRF4-deficient T,.. cells. 
a, IRF4-sufficient (Irfa™ Tr, red) and -deficient (Irf4 Tp, blue) Treg cells 
from 5-week-old Irf4*’* Foxp3°” mice and Irf4" Foxp3“ littermates 
suppress in vitro proliferative response of Foxp3_ CD4™ T cells (T) from B6 
mice. A representative of two independent experiments is shown. b, Flow 
cytometric analysis of CD25, CTLA4 and ICOS expression by IRF4-sufficent 
(red) and -deficient (blue) T,eg cells. ¢c, Numbers of IRF4-independent genes 
that were up- (red bars) or downregulated (green bars), respectively, in Tyeg 
cells compared to naive CD25” Foxp3_ CD4* T cells’. Open bars represent 
genes in which expression was changed by twofold or more in the absence of 
IRF4. d, Decreased expression of genes with a presumed role in Tyeg 
suppressor function in IRF4-deficient compared to IRF4-sufficient T,., cells. 
The data (c, d) represent average of two independent microarray 
experiments (exp) performed using YFP-Cre~ Treg cells FACS-purified from 
healthy Irf4”~ Foxp3° and Irf4*’~ Foxp3“ littermates. e, qPCR 
analysis of relative expression of genes shown in d in Irf4~ Treg (black) and 
Irf4” Treg (red). f, The decreased expression of Ty;2-specific or functionally 
important genes in IRF4-deficient in comparison to IRF4-sufficient T,,¢g cells 
(two independent microarray experiments as above). g, h, qPCR analysis of 
relative expression of the T};2-specific gene set in Irf4~ Treg (black) and 
Irf4” Treg (red) (g), and in in vitro differentiated Ty1 (black) and Ty2 cells 
(red) (h). Data in e, g and h represent mean and s.d. of the expression of 
genes relative to Hprtl in two independent experiments using three 
replicates each. i, Both IRF4 and Foxp3 bind to the promoter region of the 
Icos gene. qPCR analysis of Foxp3- and IRF4-bound chromatin isolated from 
wild-type T,<g cells using primer set corresponding to the Icos promoter 
region. IgG ChIP and qPCR using primers corresponding to the promoter 
region of Gmpr was used as a specificity controls. j, Western blot (WB) 
analysis of IRF4 in nuclear lysates of wild-type Foxp3~ Treg Cells and total 
Foxp3— CD4* T cells (control) (lanes 1 and 2), and in Foxp3 complexes 
immunoprecipitated (IP) from the nuclear lysates using Foxp3 antibody. 
Transcription factor p65 was a negative control. IRF4 signal in control 
nuclear lysates is due to the presence of activated IRF4* CD25* Foxp3” T 
cells. 
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transcriptional machinery promoting a particular effector T cell dif- 
ferentiation to efficiently control the corresponding type of the 
immune response. 


METHODS SUMMARY 
Mice. Mice containing Irf#", Irf4”, Foxp3“" and Foxp3”"® alleles were previously 
described”!*"°, The Irfé" Foxp3\ and Irfa”~ Foxp3“" mice were indistin- 
guishable in regards to the IRF4 deletion efficiency in T,.g cells and to the 
immunological and clinical manifestations of autoimmunity, and so were all 
three types of healthy littermate control mice, Irf4*!* Foxp3“, Inf” * Foxp3* 
and Irf4*!~ Foxp3. 
Flow cytometric and serum immunoglobulin analyses. Rat anti-mouse IRF4 
monoclonal antibody (clone 3E4) was raised against a glutathione S-transferase 
(GST)-fusion protein containing the carboxy-terminal 65-amino-acid sequence 
of murine IRF4. FACS data were acquired on a FACSCanto flow cytometer 
(Becton Dickinson) and analysed using FlowJo software package (Tri-Star). 
Serum IgM, IgG1, IgG2a, IgG2b, IgG3 and IgA concentrations were measured 
using SBA Clonotyping System (Southern Biotech). IgE ELISA was performed 
using biotinylated anti-IgE antibody (BD Pharmingen) and streptavidin-con- 
jugated HRP. 
ChIP and qPCR. ChIP with Foxp3 and IRF4 antibodies (Santa Cruz 
Biotechnology) were performed using MACS-purified CD4* CD25* Treg Cells 
as previously described*. qPCRs were performed using primers listed in 
Supplementary Table 1. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice. C57BL/6 (B6) mice were purchased from the Jackson Laboratory. Mice 
containing Inf’, Irf4, Foxp3\ and Foxp3?"® alleles were previously 
described”"*. The Irff"Foxp3© mice were indistinguishable from 
fa’ ~ Foxp3“ mice in regards to the efficiency of IRF4 deletion in Treg Cells, 
and to the immunological and clinical manifestations of autoimmunity, as were 
all three types of healthy littermate control mice, Irf4*/* Foxp3“, 
Irf4’* Foxp3“* and Irf4*!~ Foxp3“*. Mice were housed under specific patho- 
gen-free conditions and used according to the guidelines of the Institutional 
Animal Care Committee at the University of Washington. 

Antibodies and FACS analysis. Fluorescent-dye-conjugated antibodies were 
purchased from BD Pharmingen and eBioscience. Rat anti-mouse IRF4 mono- 
clonal antibody (clone 3E4) was raised against a GST-fusion protein containing 
the C-terminal 65-amino-acid sequence of murine IRF4. FACS data were 
acquired on a FACSCanto flow cytometer (Becton Dickinson) and analysed 
using FlowJo software package (Tri-Star). Intracellular staining of Foxp3 and 
IRF4 was conducted using Foxp3 Mouse Regulatory T cell Staining Kit 
(eBioscience). For flow cytometric analysis of cytokine- and immunoglobulin- 
secreting cells, cell populations were first stained with antibodies against the 
indicated cell surface markers, followed by permeabilization in Fix/Perm buffer, 
and intracellular staining in Perm/Wash buffer (BD Pharmingen). 
Immunofluorescence microscopy. Frozen tissue sections were fixed in cold 
acetone for 20 min, washed twice and pre-blocked in PBS containing 5% normal 
rat serum, 5% normal rabbit serum and 5% BSA for 30 min. Slides were then 
incubated in primary antibodies (anti-GL7-FITC, anti-IgD-biotin, anti-CD4- 
AF647) for 45 min, washed twice, and incubated with secondary antibodies 
(anti-FITC-AF488, StrepAv-AF555) for 45 min and fluorescence was examined 
using a Leica SL confocal microscope. 

In vitro suppression assay. For in vitro suppression assays, Tye, cells and 
‘effector’ T cells were purified by positive selection using CD4-specific MACS 
beads (Miltenyi), followed by sorting on a FACSAria cell sorter (Becton 
Dickinson). Antigen-presenting cells were prepared from wild-type B6 spleno- 
cytes by T-cell depletion using Thyl-specific MACS beads. Effector T cells 
(2 X 10* cells well!) were co-cultured with Treg Cells at indicated ratios in the 
presence of irradiated (30 Gy) antigen-presenting cells (1 X 10° cells well”') in 
96-well plates in complete RPMI1640 medium supplemented with 10% FBS and 
CD3 antibody (1 pg ml~') for 60h. One microcurie *H-thymidine was added 
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into the cultures for a further 8-12 h and cell proliferation in triplicate cultures 
was measured using a scintillation counter. 

Serum immunoglobulin ELISA. Serum IgM, IgG1, IgG2a, IgG2b, IgG3 and IgA 
concentrations were measured using SBA Clonotyping System (Southern 
Biotech). For IgE, ELISA was performed using biotinylated anti-IgE detection 
antibody (BD Pharmingen) and streptavidin-conjugated HRP. 

Affymetrix microarray and qRT-PCR. Total RNA was extracted with RNA 
Stat-60 reagent (Iso-Tex Diagnostics) from IRF4-sufficient and -deficient 
CD4* GEP/YFP-Cre* T,., cells FACS-purified from healthy Irf4*/~ Foxp3“” 
and Irf4”~ Foxp3“""™"' mice, respectively. Complementary DNA was synthesized 
using Superscript II Reverse Transcriptase (Invitrogen), and amplified twice and 
labelled using MessageAmp II aRNA Amplification kit (Ambion). Biotinylated 
complementary RNA was fragmented and hybridized to Affymetrix GeneChip 
Mouse Genome 430 2.0 arrays at the Stanford PAN Facility. All data analyses 
were performed by using Bioconductor for the statistical software R (http:// 
www.!-project.org). Expression values were background corrected, normalized 
and summarized by using the default settings of the gcrma package. From 
the resulting data sets we extracted a list of genes with a more than twofold 
change in expression and cross-referenced this list to the previously published 
set of T,.g-specific genes’. Effector T};2-specific gene data set was extracted 
by subtracting gene expression values for CD4 T cells activated in vitro under 
Ti] skewing conditions or without skewing (T};0) from those for T cells acti- 
vated under T}2 skewing conditions (K. Hilgner and K. Murphy, unpublished 
observations). 

To verify expression array data, independently prepared T,g cell samples were 
used to generate cDNA. qPCR was performed using Power SYBR Green PCR 
master mix (Applied Biosystems). PCR primer sequences are listed in 
Supplementary Table 1. 

Chromatin immunoprecipitation and qPCR. Foxp3 and IRF4 antibody (Santa 
Cruz Biotechnology) ChIPs were performed using MACS purified CD4* CD25* 
Treg cells as previously described*. GPCRs were performed using primers listed in 
Supplementary Table 1. 

Immunoprecipitation and western blot analysis. Nuclear extracts from sorted 
CD4* Foxp3* and CD4* Foxp3~ cells were prepared in nuclear lysis buffer 
(Active Motif) according to the manufacturer's protocol. 
Immunoprecipitation was carried out using anti-Foxp3-coated magnetic beads 
followed by western blotting with Foxp3, IRF4 or p65 antibodies. 
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Defensin-like polypeptide LUREs are pollen tube 
attractants secreted from synergid cells 


Satohiro Okuda'*, Hiroki Tsutsui'*, Keiko Shiina’, Stefanie Sprunck’, Hidenori Takeuchi’, Ryoko Yui’, 

Ryushiro D. Kasahara’, Yuki Hamamura', Akane Mizukami’, Daichi Susaki', Nao Kawano’, Takashi Sakakibara’, 
Shoko Namiki’, Kie Itoh’, Kurataka Otsuka*, Motomichi Matsuzaki*, Hisayoshi Nozaki’, Tsuneyoshi Kuroiwa”, 
Akihiko Nakano”®, Masahiro M. Kanaoka', Thomas Dresselhaus”, Narie Sasaki'* & Tetsuya Higashiyama‘* 


For more than 140 years, pollen tube guidance in flowering plants 
has been thought to be mediated by chemoattractants derived from 
target ovules'. However, there has been no convincing evidence of 
any particular molecule being the true attractant that actually 
controls the navigation of pollen tubes towards ovules. Emerging 
data indicate that two synergid cells on the side of the egg cell emit a 
diffusible, species-specific signal to attract the pollen tube at the 
last step of pollen tube guidance’’. Here we report that secreted, 
cysteine-rich polypeptides (CRPs) in a subgroup of defensin-like 
proteins are attractants derived from the synergid cells. We isolated 
synergid cells of Torenia fournieri, a unique plant with a protruding 
embryo sac, to identify transcripts encoding secreted proteins as 
candidate molecules for the chemoattractant(s). We found two 
CRPs, abundantly and predominantly expressed in the synergid 
cell, which are secreted to the surface of the egg apparatus. 
Moreover, they showed activity in vitro to attract competent pollen 
tubes of their own species and were named as LUREs. Injection of 
morpholino antisense oligomers against the LUREs impaired pol- 
len tube attraction, supporting the finding that LUREs are the 
attractants derived from the synergid cells of T. fournieri. 

Because the attractants derived from the synergid cells show rapid 
molecular evolution’, we decided to investigate peptides/proteins 
expressed in the synergid cells. We first developed a protocol to 
isolate synergid cells manually. The protruding embryo sacs of 
T. fournieri are especially suitable to isolate synergid cells as proto- 
plasts (Fig. 1a)°. The synergid cell can be readily distinguished from 
other cells of the embryo sac (Fig. la and Supplementary Fig. 1a). 
After poly(A) * RNA extraction, reverse transcription and a 30-cycle 
long-distance polymerase chain reaction (PCR), we established a 
complementary DNA library from 25 synergid cells. 

Expressed sequence tag (EST) and cluster analysis from 2,112 
randomly picked clones generated 256 contigs (Supplementary 
Fig. 1b and Supplementary Table 1). Interestingly, we found that 
transcripts encoding CRPs—cysteine-rich small proteins with a 
putative signal peptide’—were most abundant in the cDNA library. 
Among the eight contigs with the highest number of ESTs, seven 
represented genes for CRPs. In total, we identified 16 CRPs with a 
putative signal peptide at the amino-terminal region (Supplementary 
Table 2; 6.3% of 256 contigs). These 16 CRPs represented ~29% of 
the total number of EST sequences. The 16 CRPs were numbered as 
T£CRP1-16 (T. fournieri cysteine-rich polypeptide 1-16) according 
to the size of the contigs. Because genes for CRPs are known to 


diverge from each other rapidly’, it was difficult to find orthologues 
in other plants by molecular phylogenetic analysis. 

The expression of many kinds of CRPs from synergid cells has also 
been reported as a characteristic feature in Arabidopsis’* and 
maize”’®. Genome-wide analysis in Arabidopsis ovules has indicated 
that 56 CRPs were downregulated in a T-DNA insertion mutant of 
MYB98 (ref. 8), which encodes an R2R3-MYB transcription factor 
specifically expressed in the synergid cell”''. MYB98 is necessary for 
both normal development of finger-like projections of the synergid 
cell wall (filiform apparatus) and pollen tube guidance'’. Some of the 
CRPs predominantly expressed in the synergid cells have been shown 
to be secreted towards the filiform apparatus using green fluorescent 
protein (GFP) fusion’. 

We focused our investigation on the three largest CRP contigs 
(TfCRP1-3; Fig. 1b), which contained more than 200 ESTs 
(Supplementary Fig. 1c). Abundant expression of these TfCRPs was 
confirmed using quantitative reverse transcription—PCR (qRT-PCR; 
Supplementary Fig. 1d). They were related to defensin-like proteins 
(DEFLs)"’, as they contained CS" and y-core’* motifs common to 
defensins and all classes of cysteine-containing antimicrobial peptides, 
respectively (Supplementary Table 2). Although CRPs are highly 
polymorphic’®, putative mature TfCRP1 and TfCRP3, both of which 
contain 6 cysteines, are closely related according to the alignment of 
cysteine residues (Fig. 1b), their overall sequence similarity (62.1%), 
their acidic pK, (4.7 and 5.7) and the total number of amino acids (62 
and 70; Supplementary Table 2). In contrast, TfCRP2 contains 
8 cysteines and consists of a smaller number of amino acids (48). 

The three CRPs of T. fournieri were specifically expressed in floral 
organs, as investigated by RT-PCR (Fig. lc). TfCRP1 and TfCRP3 
were expressed specifically in the mature ovary, predominantly in the 
synergid cell. TfCRP2 was additionally expressed in the anther and 
developing fruit, but predominantly in the synergid cell within the 
mature ovary. 

We performed immunological analyses using antibodies against 
TfCRP1 and TfCRP3 (Fig. 1d, e). Consistent with the RT-PCR data 
analysis, TfCRP1 and TfCRP3 peptides were detected in the mature 
ovary of unpollinated pistils by immunoblotting analysis (Fig. 1d). 
Apparent molecular masses were 8.6kD and 9.8kD, respectively, 
which were similar to those of recombinant proteins—predicted 
mature polypeptide lacking N-terminal signal sequences (Fig. 1d; 
SignalP 3.0; Supplementary Table 2). This implied that the polypep- 
tides were cleaved as predicted and were secreted. Consistently, these 
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Figure 1 | Identification and expression analysis of major CRPs of the 
synergid cell. a, Ovules of T. fournieri were treated with a cell-wall- 
degrading enzyme solution to isolate the protruding embryo sac cells 
(arrowheads) as protoplasts (left three panels). An arrow indicates the 
micropylar end of the embryo sac to which a pollen tube is attracted. After 
collecting protoplasts of the female gametophytic cells (FGs), protoplasts of 
egg cells (ECs), synergid cells (SYs) and central cells (CCs) were further 
separated manually. Scale bars, 20 um. b, Alignment of three major CRPs 
identified by the EST-cluster analysis of the synergid cell. Only predicted 
mature CRPs are shown. Cysteine residues are labelled in pink, the 
conserved glycine residue of the defensin y-core motif in orange, and other 
residues conserved in at least two of the three sequences in blue. ¢c, RT-PCR 
analysis of organs/tissues of whole plants (left) and tissues/cells of ovaries 
(right). RT, root; ST, stem; LF, leaf; SEP, sepal; PET, petal; AT, anther; STY, 
style with a stigma; OVR, ovary; i-OVR, immature ovary with non- 
cellularized embryo sacs; FR, developing fruit containing heart-stage 
embryos; OW, ovary wall; PL, placenta; 2n-OV, sporophytic cells of ovules; 


two CRPs were detected at the micropylar surface of the filiform 
apparatus of the synergid cell by immunostaining of the ovule 
(Fig. le; Supplementary Fig. le). Properties of TfCRP1 and TfCRP3— 
namely, abundant and predominant expression in the synergid cell 
and small and secreted proteins, as well as rapid molecular evolution 
generally observed in CRPs°—all suggested that they might be pollen 
tube attractants derived from the synergid cell. 

Next we investigated whether recombinant CRPs have the ability 
to attract pollen tubes (Fig. 2). Carboxy-terminal regions of TfCRP1 
and TfCRP3 following the predicted cleavage site were expressed in 
Escherichia coli. Because TfCRP2 could not be expressed in E. coli, it 
was chemically synthesized. Polypeptides were refolded, implying 
that disulphide bonds were appropriately formed between cysteine 
residues. First, a pollen tube attraction assay was established using 
micropipettes (Fig. 2a and Supplementary Movie 1). When TfCRP3 
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GAPDH, glyceraldehyde-3-phosphate dehydrogenase. d, Immunoblotting 
analysis of T. fournieri ovary and recombinant proteins using the specific 
antibodies indicated. Recombinant TfCRP1 and TfCRP3 of predicted 
mature polypeptides lacking N-terminal signal sequences were used. The 
apparent molecular weights of TfCRP1 and TfCRP3 were 8.6 kD and 9.8 kD, 
respectively, in both analysis. Bars indicate position of molecular weight 
markers: 10.9, 16.2, 19.3 and 28.0 kD from the bottom. e, Schematic of the T. 
fournieri ovule (OV) containing a protruding embryo sac, and 
immunostaining showing CRP peptides secreted to the surface of the 
micropylar end of the embryo sac (green Alexa Fluor fluorescence, arrows). 
Alexa plus bright field images of ovules better illustrate the location of CRPs 
relative to the ovule structure. As indicated by the schematic model, the 
micropylar end of the embryo sac is occupied by a filiform apparatus (FA) 
formed by two synergid cells (green). The egg cell (yellow) is partly hidden 
behind synergid cells. CLSM, confocal laser scanning microscopy. Scale bars, 
20 um. 


was injected in front of a pollen tube, we found that 60% (n = 20) of 
the tubes turned sharply towards the pipette; this never occurred 
when buffer alone was injected (0%, n= 20). To quantitatively 
investigate the activity of the three peptides for pollen tube attraction, 
we developed a bead method, whereby peptides were embedded in 
gelatin beads and placed in front of the pollen tube (~50 1m in 
distance) by micromanipulation (Fig. 2b, c and Supplementary 
Fig. 2). The beads, ~40 1m in diameter, gradually melted on the 
medium. We found that both TfCRP1 and TfCRP3 showed signifi- 
cant activity to attract pollen tubes in a concentration-dependent 
manner (Fig. 2d, e). At the optimum concentration, 52.5 + 3.4% 
(mean + s.d.;5 n= 34; 40nM) and 65.7+ 7.4% (n= 44; 4nM) of 
pollen tubes were attracted to TfCRP1 and TfCRP3, respectively. 
When we used the protruding embryo sac of T. fournieri instead of 
the beads, 67.6% of the pollen tubes were attracted to the embryo sac 
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Figure 2 | In vitro pollen tube attraction assay using recombinant CRPs. 
a, TfCRP3 mixed with 10 kD Alexa Fluor (asterisk) was injected in front of a 
pollen tube through a micropipette at time points of 00:16 (mm:ss) and 
10:20. Spectral colours correspond to the intensity of fluorescence 
(concentration of the Alexa Fluor dye), with white representing the highest 
level (see colour scales). Scale bar, 20 tum. b, ¢, Arrowheads mark the position 
of the tip of pollen tubes when gelatin beads (asterisks) containing TfCRP1 
(b) and TfCRP3 (c) were placed (00:00). Double arrowheads indicate the tips 
of the tubes growing towards the beads. Green fluorescence in the assaying of 


(n= 34). Surprisingly, even a very low amount of TfCRP1 and 
TfCRP3 (40 pM) already showed activity, which means that only 
~10° molecules in a bead are sufficient to attract pollen tubes. In 
contrast, TfCRP2 (>2.711M) did not show attraction activity 
(7.8 + 1.2%; n = 39), at least in our conditions using the chemically 
synthesized peptide. 

In the in vitro Torenia system, pollen tubes germinated in vitro are 
not competent to respond to the attraction signal from the synergid 
cells’®. Pollen tubes are stimulated by unknown sporophytic female 
factors to acquire their competence’. Moreover, pollen tubes of a 
closely related species, Lindernia micrantha, do not respond to the 
attraction signal derived from the synergid cells of T. fournieri’. 
Consistently, TfCRP1 and TfCRP3 did not show any activity on 
pollen tubes germinated on medium or on tubes of L. micrantha 
(Fig. 2d, e). The latter result was also consistent with the specificity 
of antibodies against TfCRP1 and TfCRP3, which did not label the 
filiform apparatus of L. micrantha (Supplementary Fig. le). Thus, the 
activity of TfCRP1 and TfCRP3 was similar to that of the attraction 
signal derived from the synergid cells of T. fournieri. We also con- 
firmed that SCR (also known as SP11) of Brassica, one of the DEFLs 
(8 cysteines) that is a pollen determinant of self-incompatibility’’, 
showed no activity in the beads assay (1.7 + 3.7%; n= 57; 40nM of 
SP11 (S8-allele)). The empty vector control also showed no activity 
(5.4 + 4.9%; n = 35). Hereafter, we named the two CRPs showing the 
ability to attract the pollen tube as LUREs (TfCRP1 as LURE] and 
TfCRP3 as LURE2). LURE] and LURE2 were heat-labile (Fig. 2d, e). 
We confirmed using LURE2 that the refolding step was important for 
its activity (20.6+ 12.5% attraction without refolding; n= 39; 
4nM). 

To support further the finding that LUREs are involved in pollen 
tube attraction by the synergid cell, we tried to downregulate LUREs 
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TfCRP1 shows the remaining concentration gradient of 10 kD Alexa Fluor at 
20 min after placement of the bead. Scale bar, 20 um. d, e, Pollen tube (PT) 
attraction activity of TfCRP1 (d) and TfCRP3 (e), which depends not only 
on the concentration of peptides but also on the competency, species origin 
of the pollen tube and preparation of polypeptides (heat treatment, 95 °C for 
5 min). The purified and refolded polypeptides were directly used without 
further fractionation, implying that both active and non-active forms of 
polypeptides would contribute to the concentration. Data are means and 
standard deviation (n > 3 with more than 10 pollen tubes per replicate). 


by injecting morpholino (MO) antisense oligomers (oligos)'” into 
the protruding embryo sac cells of T. fournieri (Fig. 3). Efficient 
microinjection into plant cells with an intact cell wall and high 
turgor pressure was achieved by developing a_laser-assisted 
thermal-expansion microinjector. We injected fluorescein isothio- 
cyanate (FITC)-labelled MO antisense oligos into the central cell, 
which is the easiest cell to access. FITC fluorescence spread into the 
cytosol and nuclei of every gametophytic cell, including the two 
synergid cells (Fig. 3a), possibly through their plasmodesmata’. 

At the start of cultivation, we injected a mixture of MO antisense 
oligos against the translation initiation site, the splice donor site and 
the splice acceptor site. Because chemically synthesized TfCRP2 did 
not show pollen tube attraction, we tested antisense oligos for LURE1 
(TfCRP1) and LURE2 (TfCRP3). At 16 to 18h after the start of 
cultivation, we measured whether pollen tube attraction occurred 
in each injected embryo sac (Fig. 3). We found that antisense oligos 
for LURE] and LURE2 specifically decreased the frequency of attrac- 
tion, compared to oligos with inverted sequences (P= 0.0083 for 
LURE] and P= 0.0113 for LURE2; Fisher’s exact test). These results 
further indicated that LURE and LURE2 are involved in pollen tube 
attraction by the synergid cells. 

After a long search for attractants’, it is now widely accepted that 
pollen tubes receive multiple, stage-specific guidance cues before 
arriving at the range of attraction governed by the synergid cell (a 
few hundred micrometers at most). Although it has to be elucidated 
whether other chemoattractants are actually involved in such guidance 
cues, various candidates of attractants have been reported, including a 
lily small cell wall protein chemocyanin”’ and water gradient in lipids”®, 
as factors for directional growth of pollen tubes at the stigma. GABA 
(y-aminobutyric acid) has been suggested as sporophytic signal for 
ovule targeting*'. A candidate for the attractant derived from the 
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Figure 3 | Microinjection of MO antisense oligos for LUREs. a, MO 
antisense oligos labelled with FITC (green fluorescence) were injected into 
the protruding embryo sac of T. fournieri at the start of cultivation (2h). An 
inset in the top left panel shows an injected ovule. At 16h after the start of 
cultivation, ovules without (w/o) injections showed normal pollen tube 
attraction (arrow) to the protruding embryo sac (arrowhead), whereas many 
of the injected ovules showed inhibition of pollen tube attraction. Asterisk 
indicates the cut end of the pollinated pistil. Scale bar in the top left panel is 
200 jim, and in other panels is 20 jum. b, Injection of antisense oligos (anti.) 
showed a reduced frequency of attraction compared to injection of inverted 
oligos (inv.). The asterisk shows significant differences by Fisher’s exact test 
(P < 0.02). Data are frequencies for the total number of embryo sacs 
examined (1) in more than five replicates. 


synergid cell has been reported in maize; Zea mays EGG APPARATUS- 1 
(ZmEAI, Q5G8Z3), a small membrane protein predominantly 
expressed in the synergid cell, is likely to spread to the micropylar 
nucellar cell walls of the ovule and is necessary for pollen tube 
guidance”. Unlike LUREs, these candidate molecules do not belong 
to CRP subfamilies. 

In contrast, flowering plants use various CRPs at different steps of the 
pollen-pistil interaction (Supplementary Fig. 3). SCR/SP11, for 
example, belongs to another subgroup of DEFLs containing eight 
cysteines and acts as the male determinant of the self-incompatibility 
reaction in the Brassicaceae'®. Stigma/stylar cysteine-rich adhesin 
(SCA) belongs to lipid transfer proteins and is involved in adhesion 
of the pollen tube to the extracellular matrix of female tissues”. 
Interestingly, SCA enhanced the activity of chemocyanin synergisti- 
cally’’, whereas LURE] and LURE2 could work solely but not synergis- 
tically, as examined by mixing them together (50.0 + 6.3% attraction; 
n= 48; 400 pM LURE] and 40 pM LURE2). Other subclasses of DEFLs 
such as a pollen coat protein PCP-Al (CAA06465; ref. 24) and Z. mays 
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EMBRYO SACI (ZmES1, AAK08132)” might be involved in intercel- 
lular signalling of gametophytic cells. Most of these CRP subclasses were 
detected in our EST analyses (Supplementary Fig. 3). 

It is known that the synergid cell is a key player controlling various 
fertilization mechanisms*>**. A number of synergid-cell-specific or 
predominant genes have been found recently’, including MYB98, 
ZmEAI and FERONIA, a receptor-like kinase required for pollen 
tube growth termination”. Our results now indicate that LUREs, 
members of the CRP super gene family, are the pollen tube attrac- 
tants derived from the synergid cell of T. fournieri. LUREs showed 
abundant expression, which is not the usual case for a signalling 
molecule; this might be important to keep a gradient of the concen- 
tration. The synergid cell of flowering plants, which locates at the 
entrance of the female gametophyte, expresses many more kinds of 
CRPs, which may have evolved originally from components of the 
plant natural immunity system. It will be exciting to find out the role 
of the other CRPs, which might include factors for defence, pollen 
tube attraction and other fertilization mechanisms. 


METHODS SUMMARY 

Plant materials. Torenia fournieri cv. ‘blue and white’ and Lindernia micrantha 
were used’. 

EST and expression analysis. Ovules of T. fournieri were treated with an enzyme 
solution to detach embryo sacs as protoplasts. Synergid cells were collected by 
using a micropipette under an inverted microscope. After amplification by 
PCR”, cDNAs were cloned and sequenced to generate >4,000 ESTs. EST contigs 
were assembled by Phred/Phrap software and were manually checked to deter- 
mine whether they were CRPs° and contained a putative signal peptide (SignalP 
3.0). Expression of CRP genes was confirmed by RT- and qRT-PCR. 
Immunological analysis. Proteins of mature ovaries were purified using chlo- 
roform and methanol. Pre-immune and anti-LURE affinity purified antibodies 
(1:1,000 dilution) and peroxidase-conjugated anti-rabbit IgG (1:5,000 dilution; 
KPL) were used. For immunostaining, purified antibodies (1:1,000 dilution) and 
Alexa Fluor 488 anti-rabbit IgG (Invitrogen; 1:1,000 dilution) were used. 
Purification of peptides and in vitro attraction assays. Recombinant LUREs 
were expressed in E. coli and purified by HPLC. TfCRP2 was chemically synthesized 
(Operon). After dialysis and refolding, each peptide was injected in front of pollen 
tubes. For bead assays, peptides were embedded in gelatin beads with Alexa Fluor 
dyes conjugated with 10 kD dextran (Invitrogen). Each bead was picked with a glass 
needle and placed in front of pollen tubes. 

Microinjection of MO oligos. A mixture of MO antisense oligos 
(Supplementary Table 3; Gene Tools LLC) conjugated with FITC was used to 
inhibit translation and splicing. We developed a laser injector, which has been 
commercialized along with consumable supplies by Nepa Gene. A glass needle 
was loaded from the bottom with MO oligos, a hydrophobic ionic liquid, a laser 
beam absorbent and an ultraviolet curable resin. After the resin was cured, 
sealing the bottom end, the MO oligos were injected into cells by heat-expanding 
the laser absorbent with a laser beam. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Plant materials. We used Torenia fournieri cv. ‘blue and white’ and Lindernia 
micrantha (Vandellia angustifolia)* plants. Pollen tubes growing through a cut, 
pollinated style were cultured with ovules for 14-18 h on solid medium”’. For the 
in vitro assay using gelatin beads, we used pre-conditioned medium instead of 
cultivation with ovules, which was prepared by cultivating ovules on the placentas 
from ten ovaries in 1 ml fresh liquid medium overnight (20h). The culture was 
covered with hydrated silicone oil (KF-96-100CS; Shin-Etsu) before assaying and 
microinjection. 

EST and expression analysis. Each ovary of Torenia was excised in 200 ul of 
enzyme solution (1% cellulase (Worthington), 0.3% macerozyme RS (Yakult), 
0.05% pectolyase Y-23 (Yakult), 5mM calcium nitrate and 0.4M mannitol, 
pH5.8) in a glass-bottom dish. After incubation at 28 °C for 60 min, detached 
gametophytic protoplasts were collected using a micropipette (Narishige) under 
an inverted microscope. After collection of the protoplasts, synergid cells were 
separately collected and washed in 0.4M mannitol solution containing 5mM 
calcium. Synergid cells were transferred to an Eppendorf tube using a micro- 
scope and stored at —80°C. PolyA*-RNA was extracted from 25 synergid cells 
and used for reverse transcription”. The cDNA was amplified by long-distance 
PCR using an optimal number of 30 cycles, as determined by Southern hybridi- 
zation of 5-1] cDNA aliquots amplified by 18 to 33 PCR cycles each, using a DIG- 
labelled probe of Torenia GAPDH. This PCR probe was obtained using Torenia 
cDNA and the primers TfGAPfw and TfGAPrev. The amplified cDNAs were gel- 
purified by using a 200-nucleotide cutoff so that degraded RNAs and primer 
dimers were removed and that mRNAs encoding small peptides were obtained. 
The PCR products were cloned into the pGEM-T Easy vector (Promega) by TA 
cloning. Plasmids from randomly picked clones were sequenced from both 
directions (Agowa). EST sequences were assembled into contigs using Phred/ 
Phrap software with masking of adenine stretches of 30 or more nucleotides. All 
256 contigs were manually checked to determine whether they were CRPs° and 
had a putative signal peptide (SignalP 3.0). RT-PCR analysis for each ovular 
tissue or cell was performed using samples collected under the inverted micro- 
scope. For qRT-PCR, gene-specific TaqMan probes were used (Supplementary 
Table 3). 

Immunological analysis. Antigen LUREs with or without refolding were 
prepared as described below and used to immunize rabbits. Immunoglobulin 
Gs of the pre-immune serum and anti-LUREs were purified by affinity chromato- 
graphy (HiTrap Protein G HP, GE Healthcare). For immunoblotting analysis, 
mature ovaries of T. fournieri (~100 mg) were collected and frozen in liquid 
nitrogen. The ovaries were homogenized in buffer (50mM Tris-HCl, pH7.5, 
50 mM NaCl, 2.5 mM EDTA, 5 mM DTT (or 20 mM TBP) and a protease inhibitor 
cocktail (Roche)). Proteins were purified by adding chloroform, methanol and 
distilled water (DW) (sample:chloroform:methanol:DW was 1:1:4:3 v:v:v:v). After 
centrifugation, precipitated proteins were washed with methanol and dried. After 
electrophoresis using 15% or 16.5% SDS polyacrylamide gels and Prestained XL- 
Ladder Broad marker (Low; Promega), peptides were blotted onto PVDF mem- 
branes (Immobilon-P or -PSQ, Millipore). Purified pre-immune and anti- LURE 
antibodies (1:1,000 dilution) and peroxidase-conjugated anti-rabbit goat IgG 
(1:5,000 dilution; KPL) were used. For immunostaining, ovules on the placenta 
were fixed using 4% paraformaldehyde in PBS for 40 min. After treatment with 
0.2% Triton X-100 for 5 min and blocking with 3% BSA for 1h, samples were 
treated for 2h with specific antibodies (1:1,000 dilution). The samples were then 
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treated for 2 h with Alexa-Fluor-488-conjugated anti-rabbit goat IgG (Invitrogen; 
1:1,000 dilution). After excision from the placenta, stained ovules were observed 
under an epi-fluorescence microscope or a CLSM”. 

Purification of peptides and in vitro attraction assays. Coding sequences of 
LUREs lacking N-terminal signal sequences (Supplementary Table 2) were amplified 
from the cDNA of T. fournieri ovaries and cloned into the vector pGEX-6p-1 (GE 
Healthcare) or pET-28a(+) (Novagen) to fuse glutathione-S-transferase (GST) tag 
or poly-histidine (His) tag to the N-terminal, respectively. Peptides were expressed 
in E. coli (BL21-CodonPlus; Stratagene) at 37°C overnight and purified by a 
glutathione sepharose column 4B (Amersham) or a Ni column (His-Trap FF). 
The GST tag was removed by a Precision protease (Amersham; five linker amino 
acids remained). These peptides without a tag were used as recombinant proteins 
unless otherwise noted. Predicted mature TfCRP2 was chemically synthesized 
(Operon). The peptides were dialysed (Spectra/Por3 MWCO:3,500 or Spectra/ 
Por6é MWCO:1,000; Spectrum Laboratories) and refolded for 3 days at 4°C using 
glutathione (reduced and oxidized forms; Wako) and L-arginine ethyl ester dihy- 
drochloride (Sigma). For in vitro assays using micropipettes, each peptide in buffer 
(50 mM Tris-HCl, pH 8.0) was loaded into a glass needle as described below. The 
microinjector was positioned using a micro-manipulator (Narishige) attached to an 
inverted fluorescence microscope. Each peptide was injected in front of the pollen 
tubes. Behaviour of pollen tubes was photographed by a CCD camera or by a time- 
lapse digital video system (Sigma Koki). For in vitro assays using gelatin beads, 1 tl of 
peptides with His tag was mixed with 2 ll of 10% (w/v) gelatin (Nacalai) pre-melted 
at 50 °C and 1 pl of 1 mM Alexa Fluor 488, 546 or 568 conjugated with 10 kD dextran 
(Invitrogen). After adding 150 ul of hydrated silicone oil, they were mixed by 
vortexing and cooled on ice for formation of gelatin beads. Each bead (approxi- 
mately 40 jtm in diameter) was picked using a glass needle and placed in front of the 
pollen tubes. The criteria for judgment of ‘attracted’ and ‘non-attracted’ pollen tubes 
are summarized in Supplementary Fig. 2. 

Microinjection of MO oligos. Three kinds of MO antisense oligos conjugated 
with FITC (Gene Tools LLC) were designed for inhibition of the translation 
initiation site and the splice donor and acceptor sites of each LURE. MO oligos 
with inverted sequences were also designed as negative controls. After dissolu- 
tion in distilled water at a concentration of 1mM, each 1.5 pl of the three 
antisense MO oligos, as well as the three inverted MO oligos, were mixed (each 
0.33 mM) and centrifuged at 30,000g for 20 min to precipitate impurities. We 
developed and commercialized a method of laser-assisted microinjection (Nepa 
Gene). A glass needle, with an outer diameter at the tip of 0.2—0.3 jtm, was made 
by pulling glass capillaries (GDC1; Narishige) with P-97 (Sutter) or MCF-100 
(Nepa Gene) micropipette pullers. The needle was loaded with MO oligos, a 
hydrophobic ionic liquid, a laser beam absorbent and an ultraviolet curable resin 
through the bottom. After the resin was cured by irradiation with ultraviolet light 
to seal the bottom end, the needle was placed in the holder of the laser injector 
(LTM-1000; Nepa Gene). MO oligos were injected into cells by heating and 
expanding the laser absorbent by use of a laser beam. The amount of injected 
oligonucleotides was adjusted to ~1% (v/v) of the volume of the central cell by 
observing the fluorescence of FITC with a CCD camera. Micro-injection was 
performed during the first 3 h after the start of cultivation. 
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Nucleosome organization is critical for gene regulation’. In living 
cells this organization is determined by multiple factors, including 
the action of chromatin remodellers’, competition with site- 
specific DNA-binding proteins’, and the DNA sequence prefer- 
ences of the nucleosomes themselves**. However, it has been dif- 
ficult to estimate the relative importance of each of these 
mechanisms in vivo’®"’, because in vivo nucleosome maps reflect 
the combined action of all influencing factors. Here we determine 
the importance of nucleosome DNA sequence preferences experi- 
mentally by measuring the genome-wide occupancy of nucleo- 
somes assembled on purified yeast genomic DNA. The resulting 
map, in which nucleosome occupancy is governed only by the 
intrinsic sequence preferences of nucleosomes, is similar to in vivo 
nucleosome maps generated in three different growth conditions. 
In vitro, nucleosome depletion is evident at many transcription 
factor binding sites and around gene start and end sites, indicating 
that nucleosome depletion at these sites in vivo is partly encoded in 
the genome. We confirm these results with a micrococcal nuclease- 
independent experiment that measures the relative affinity of 
nucleosomes for ~40,000 double-stranded 150-base-pair oligonu- 
cleotides. Using our in vitro data, we devise a computational model 
of nucleosome sequence preferences that is significantly corre- 
lated with in vivo nucleosome occupancy in Caenorhabditis ele- 
gans. Our results indicate that the intrinsic DNA sequence 
preferences of nucleosomes have a central role in determining 
the organization of nucleosomes in vivo. 

We sought to establish the extent to which the DNA sequence 
determines nucleosome organization in living cells. Our strategy, 
previously used by others for two yeast promoters'’, was to compare 
in vivo nucleosome organization with that obtained by an in vitro 
assembly procedure using only purified nucleosomes and purified 
DNA. To obtain a genome-wide map of nucleosome occupancy 
governed solely by nucleosome sequence preferences, we purified 
chicken erythrocyte histone octamers and assembled them on puri- 
fied yeast genomic DNA by salt gradient dialysis'’. We then isolated 
mononucleosomes by micrococcal nuclease digestion, and used 
parallel sequencing to determine nucleosome positions. We 
performed two independent experiments, resulting in ~ 10,000,000 
DNA sequence reads that map uniquely to the yeast genome. For 
comparison to in vivo nucleosome positions, we isolated mononu- 
cleosomes from living cells®”*"°, and obtained ~25,000,000 sequence 
reads from 6 independent experiments. For each map, we deter- 
mined the average nucleosome occupancy at every base pair, calcu- 
lated as the log-ratio between the number of reads that cover that base 


pair and the genome-wide average coverage per base pair (see 
Methods). 

The nucleosome organizations of the in vitro and in vivo maps are 
notably similar, although not identical (Fig. 1), with a correlation of 
0.74 between the nucleosome occupancy per base pair (Fig. 2a). On 
the scale of individual nucleosomes, the in vitro data separate regions 
that are enriched in nucleosomes in vivo from regions depleted of 
nucleosomes with high accuracy (Supplementary Fig. 1). Similarly, 
we found a significant correspondence between the positions of 
stable nucleosomes in the two maps (Supplementary Fig. 2). This 
high degree of similarity between the maps indicates that nucleosome 
sequence preferences have a dominant role in determining in vivo 
nucleosome organization. 

The correlation between the maps is not uniform across the gen- 
ome. We found a higher correlation between the maps at non-pro- 
moter intergenic regions located at ends of convergently transcribed 
genes (0.83) and a lower correlation at promoter (0.69) and coding 
(0.69) regions. In addition, the depletion level in vivo relative to that 
measured in vitro at coding regions increases with the expression level 
of the associated genes (Fig. 2b). These results indicate that transcrip- 
tion factors, chromatin regulators and active transcription influence 
the resulting nucleosome organization in vivo. 

Because the nucleosome organization in vitro is determined only 
by the DNA sequence, we asked whether we could derive rules that 
are predictive of nucleosome positioning and occupancy. For each 
of the 1,024 sequences of length 5 base pairs, we computed the 
average nucleosome occupancy of that sequence across all of its 
instances in the genome. We found a near perfect agreement (cor- 
relation of 0.98) between the average occupancy of these 5-base-pair 
sequences in vivo and in vitro (Fig. 3a). Many 5-base-pair sequences 
showed strong preferences for nucleosome-enriched or nucleosome- 
depleted regions. For example, AAAAA has the lowest average 
nucleosome occupancy both in vivo and in vitro, consistent with 
the reduced nucleosome affinity that poly(dA-dT) sequences have 
in vitro’, and with the nucleosome depletion observed over poly(dA- 
dT) sequences in vivo”'*. Consistent with previous reports**"*"°, we 
also found clear ~10-bp periodicities of dinucleotides along the 
nucleosome length, both in vitro and in vivo (Fig. 3b, c). Notably, 
the dynamic range of these periodicities is greater in vitro, suggesting 
that the fraction of nucleosomes positioned by these periodic 
motifs in vitro is greater than that in vivo. This difference may be 
due to the action of chromatin remodellers and transcription factors 
in vivo, which may cause nucleosomes to deviate from the locations 
dictated by the nucleosome sequence preferences. The higher 
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Figure 1| The intrinsic DNA-encoded 
nucleosome organization at a typical genomic 
region. Shown are the four different maps of 


nucleosome occupancy measured in this study 
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for a typical 20,000-bp-long genomic region: the 
in vitro map, which reflects only the intrinsic 
nucleosome sequence preferences, and in vivo 
yeast maps for three different growth conditions 
(YPD, ethanol and galactose). Each track plots 
the measured nucleosome occupancy per base 
pair, computed by summing all of the 
nucleosome reads obtained in that experiment, 
and dividing that number by the average number 
of reads per base pair across the genome. The line 
of y = 1 thus represents the genome-wide average 
and is shown as a dashed orange line. The average 
nucleosome occupancy predictions from our 
model are shown in blue. 
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concentration of nucleosomes in vivo relative to the concentration 
used to create our in vitro map may also contribute to this difference, 
because higher nucleosome concentrations generally increase the 
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Figure 2 | In vitro and in vivo maps are highly similar. a, Shown is a density 
dot plot comparison of the normalized nucleosome occupancy per base pair 
in the in vitro (x axis) and in vivo (y axis) maps (see Methods). Values above 
zero indicate nucleosome enrichment relative to the genome-wide average. 
The colour of each point represents the number of base pairs that map to 
that point in the graph. The Pearson correlation between the maps is 
indicated. b, Nucleosome depletion in vivo relative to in vitro over coding 
regions increases with the expression level of associated genes. Shown is a dot 
plot comparison between the expression level of every yeast gene (measured 
in ref. 26) and the difference between the average normalized nucleosome 
occupancy of the coding region of that gene in the in vitro map compared 
with the in vivo map (that is, higher values indicate larger nucleosome 
depletion in vivo relative to in vitro). The Pearson correlation of the dot plot 
is indicated. 


contribution of non-specific binding, thus diminishing the contri- 
bution of the ~10-bp sequence periodicities. Nevertheless, the 
conservation of the ~10-bp dinucleotide periodicities and the 
near-identity of 5-base-pair nucleosome occupancies demonstrate 
that nucleosomes have clear sequence preferences that are highly 
similar in vitro and in vivo. 

To test whether general sequence-based rules can be derived from 
our in vitro data and be used to predict nucleosome occupancy in 
vivo, we constructed a simple probabilistic model based on both the 
global preferences over sequences of length 5 and the position- 
dependent dinucleotide preferences*'’, which scores the nucleosome 
formation potential of every 147-bp sequence. Importantly, this 
model is learned only from in vitro nucleosome data, and therefore 
represents only nucleosome sequence preferences, whereas previous 
models**’*, which were learned from in vivo data, may also capture 
sequence preferences of other factors’, as well as indirect effects due 
to chromatin remodelling activities. We tested the model in a cross- 
validation scheme in which the nucleosome occupancy of each chro- 
mosome was predicted using a model that was constructed from the 
data from all other chromosomes. Our model has high correlations of 
0.89 and 0.75 with the in vitro and in vivo maps, respectively (Fig. 3d, 
e), and separates nucleosome-enriched regions from nucleosome- 
depleted regions (Supplementary Fig. 3), indicating that the model 
successfully identified general predictive rules for the sequence pre- 
ferences of nucleosomes. 

If nucleosome sequence preferences are important in other eukar- 
yotes, then our model should also be predictive of their in vivo 
nucleosome organization. Indeed, we found a good (0.60) correlation 
between the nucleosome occupancy per base pair predicted by our 
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in vitro yeast-based model and that of in vivo nucleosome occupancy 
in C. elegans'” (Fig. 3f). Moreover, our model classifies nucleosome- 
enriched regions from nucleosome-depleted regions in C. elegans with 
high accuracy (Supplementary Fig. 4), and the 5-base-pair sequence 
preferences of the C. elegans in vivo map agree well with those of the 
yeast in vitro map (Fig. 3g). The poorer classification performance in 
comparison with yeast may indicate that factors other than the DNA 
sequence preferences make a greater contribution to nucleosome 
organization in more complex eukaryotes. Alternatively, the poorer 
performance may indicate that distinct sequence types are present in 
C. elegans for which our yeast in vitro data do not provide statistics. 
Nonetheless, our model is significantly correlated with the in vivo 
nucleosome organization across C. elegans. 

We next compared the DNA-encoded nucleosome organization of 
the in vitro map with nucleosome organization under growth con- 
ditions that cause substantial transcriptional changes relative to log- 
phase growth in rich medium (that is glucose). In addition to our 
map obtained from yeast cells grown in rich medium, we also mea- 
sured the nucleosome organization of yeast cells grown separately in 
galactose, and in ethanol, and found that the overall nucleosome 
occupancy is very similar between all three in vivo maps, although 
localized differences are apparent (Fig. 1 and Supplementary Fig. 5). 
All three in vivo maps are highly correlated with the in vitro map 
and show the sequence characteristics seen in vitro (Supplementary 
Fig. 6). These results imply that intrinsic sequence preferences of 
nucleosomes have a dominant role in determining nucleosome 
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organization in several growth conditions, with local, condition-spe- 
cific changes superimposed. 

To address concerns regarding biases that may be caused by the 
sequence specificity of micrococcal nuclease” and possible biases in 
parallel sequencing, we performed a different kind of in vitro experi- 
ment that measures the relative nucleosome affinity of ~40,000 dou- 
ble-stranded 150-bp oligonucleotides without the use of micrococcal 
nuclease or parallel sequencing. The resulting 5-base-pair nucleo- 
some sequence preferences are in excellent agreement with those 
discovered in the genome-wide in vitro reconstitution (correlation 
of 0.83), and there is a good correlation (0.51) between the measured 
oligonucleotide affinities and those predicted by the model con- 
structed from the genome-wide in vitro map (Supplementary Fig. 
7). These results are wholly independent of either micrococcal nucle- 
ase or parallel sequencing, and thus confirm that the sequence spe- 
cificities derived from our previous experiments were caused by 
intrinsic nucleosome preferences, rather than being an artefact of 
our experimental approach. 

Previous studies identified nucleosome depletion around tran- 
scription start and stop sites>”°"'. However, because these studies 
were based on in vivo data, it was not possible to determine which 
mechanism accounted for the observed patterns. The in vitro and in 
vivo maps show highly similar stereotypic nucleosome depletion at 
translation end sites, indicating that this depletion is largely encoded 
by nucleosome sequence preferences (Fig. 4b and Supplementary 
Fig. 8). The two maps also show stereotypic nucleosome depletion 
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Figure 3 | The in vitro sequence preferences of nucleosomes are highly 
similar to those of nucleosome-bound sequences in vivo and are predictive 
of nucleosome occupancy in C. elegans. a, Comparison of genome-wide 
relative nucleosome occupancy of nucleosomes over sequences of length 5. 
For the in vitro and in vivo maps of nucleosome occupancy, we separately 
computed the average normalized nucleosome occupancy of each of the 
1,024 sequences of length 5, across all of its instances in the genome. Shown 
is a comparison between the distributions of these 5-base-pair sequences in 
both maps. Also shown is the Pearson correlation between these 
distributions. b, Position-dependent sequence preferences of nucleosomes 
in the in vitro map. We aligned the individual nucleosome reads in the in 
vitro nucleosome collection. Shown is the fraction (3-bp moving average) of 
AA/AT/TT/TA and CC/CG/GC/GG dinucleotides at each position of the 
alignment. c, Same as b, for the in vivo map. d, Shown is a density dot plot 
comparison between the normalized nucleosome occupancy per base pair in 
the in vitro map (x axis) and the normalized nucleosome occupancy per base 
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Normalized nucleosome 
occupancy in vivo (C. elegans) 


Average nucleosome occupancy 
on 5-mers in vivo (C. elegans) 


pair predicted by our cross-validated computational model of nucleosome 
sequence preferences (y axis). Values above zero indicate nucleosome 
enrichment relative to the genome-wide average. The colour of each point 
represents the number of base pairs that map to that point in the graph. The 
Pearson correlation between the maps is indicated. e, Same as d, comparing 
our model predictions to the in vivo map. f, In vitro nucleosome sequence 
preferences on yeast genomic DNA are predictive of the in vivo nucleosome 
organization in C. elegans. Same as d, comparing our model predictions and 
the in vivo nucleosome occupancy map of C. elegans on chromosome 2 
(ref. 19). g, Comparison of yeast nucleosome sequence preferences in vitro 
and those of C. elegans in vivo. For each of the maps we separately computed 
the average normalized nucleosome occupancy of every possible sequence of 
length 5. For C. elegans, we performed these computations on 
chromosome 2. Shown is a comparison of these 5-base-pair sequence 
distributions between the yeast in vitro map and the in vivo map of C. 
elegans, along with the Pearson correlation between these distributions. 
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at transcription start sites, indicating that this depletion is also partly 
encoded by nucleosome sequence preferences (Fig. 4a). However, the 
level of depletion around transcription start sites in vitro is smaller 
than in vivo, indicating that transcription factors, chromatin remo- 
dellers, the transcription initiation machinery and other mechanisms 
also contribute to the depletion. Another difference around start sites 
(Fig. 4a) is the longer-range ordering of nucleosomes into coding 
regions observed in the in vivo map. This may be partly explained by 
the higher nucleosome concentration in vivo (see Methods), which 
causes increased ordering by statistical positioning”. 

Nucleosome depletion has also been reported around transcription 
factor binding sites in vivo”*"'. In both our in vivo and in vitro maps, 
nucleosome depletion was observed, on average, around the 
chromatin-immunoprecipitation-determined binding sites” of most 
transcription factors. The maps also agree on the degree of depletion 
around binding sites (correlation of 0.62; Fig. 4c and Supplementary 
Fig. 9). These results indicate that nucleosome depletion around 
regulatory factor binding sites is partly encoded in the genome’s 
intrinsic nucleosome organization, and that this intrinsic organiza- 
tion may facilitate transcription initiation and assist in directing tran- 
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Binding sites for the yeast factors Abfl and Reb1 show the largest 
deviation from the above agreement, with sites for both factors being 
more depleted in vivo than in vitro. Notably, both factors are highly 
abundant and influence chromatin structure™*”’, indicating that the 
depletion around their sites in vivo may be attributable to their own 
action. The large nucleosome depletion over these factor sites 
resulted in them being major components of a nucleosome occu- 
pancy model created from in vivo data’. However, because the model 
constructed here is based solely on in vitro data, in which these sites 
have only moderate depletion, they are not major components of our 
model. 

In summary, we find marked similarities between the nucleosome 
organization governed only by the DNA sequence preferences of 
nucleosomes, and the organization of nucleosomes in vivo measured 
under different growth conditions. This result indicates that nucleo- 
some sequence preferences are important determinants of nucleo- 
some organizations in vivo. Our analysis indicates that gnomes may 
use their intrinsically encoded nucleosome organization to facilitate 
functions such as transcription factor binding and transcription. 
Despite the overall similarity between the in vitro and in vivo maps, 
there are differences, consistent with previous studies showing that 
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Figure 4 | The intrinsic nucleosome organization over transcripts and 
transcription factor binding sites. a, For the in vitro and in vivo nucleosome 
occupancy maps, and for our model, shown is the normalized nucleosome 
occupancy per base pair around the transcription start site, averaged across 
all yeast genes. The long-range ordering of nucleosome occupancy which is 
present in the in vivo maps but not in the in vitro map may be partly 
explained by the lower nucleosome concentration in which the in vitro 
experiment was carried out (see Methods), because higher nucleosome 
concentration in vivo is predicted to cause long-range ordering of 
nucleosome arrays”'. b, Same as a, but around translation end sites of genes 
(translation end was chosen because transcription end sites are poorly 
annotated). The depletion around gene ends may be due to the presence of 
termination signals”’, which disfavour nucleosome formation in vitro 
(Supplementary Fig. 8). The fact that these signals tend to occur in a specific 


YPD — In vitro — 


orientation with respect to the direction of transcription” is consistent with 
a function in transcript processing, but does not exclude the possibility that 
one or more of these motifs functions primarily to disfavour nucleosomes. 
c, Comparison of the nucleosome occupancy over transcription factor 
binding sites between the in vitro and the YPD in vivo maps. For each 
transcription factor with at least 50 functional binding sites”, we computed, 
separately for the in vivo and in vitro maps, the average normalized 
nucleosome occupancy over its binding sites. Shown is a comparison of these 
nucleosome occupancies per factor, between the in vivo and in vitro maps, 
along with the Pearson correlation between them. For six factors taken from 
different regions of the plot, we also show the average normalized 
nucleosome occupancy around those factors’ binding sites, for both the in 
vitro and the in vivo maps. 
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factors other than nucleosome sequence preferences contribute to 
nucleosome organization in vivo. Future studies will focus on under- 
standing how nucleosomes are remodelled locally and the function of 
such remodelling in transcriptional regulation. 


METHODS SUMMARY 


In vivo maps from yeast nucleosome DNAs were prepared from log-phase cells 
grown in rich medium (YPD, six independent replicates) as described prev- 
iously*, as well as from cells grown in YP media supplemented with 2% galactose 
(three replicates) or 2.8% ethanol (four replicates) instead of glucose. The result- 
ing DNAs were subjected to Illumina sequencing-by-synthesis. For the in vitro 
map, histone octamer was purified from chicken erythrocytes, assembled on 
purified yeast genomic DNA by salt gradient dialysis'*, digested with micrococcal 
nuclease and subjected to Illumina sequencing (two independent replicates). The 
resulting in vitro map has a lower concentration of nucleosomes along the DNA 
than obtained in vivo. This technical limitation was necessitated by our finding 
that reconstitutions at the in vivo stoichiometry on long genomic DNA resulted 
in insoluble chromatin that was inaccessible to micrococcal nuclease. We 
mapped the resulting reads to the genome and removed reads that mapped to 
multiple genomic locations. We extended the nucleosome reads of each experi- 
ment to the average nucleosome length in that experiment (always between 140- 
170 bp). For each map, we then calculated the normalized nucleosome occu- 
pancy at every base pair as the log-ratio between the number of reads that cover 
that base pair and the average number of reads per base pair across the genome. 
We then set the genomic mean in each sample to zero by subtracting the genome- 
wide mean from every base pair. The independent replicates for each experiment 
type were in excellent agreement, so we averaged the replicates within each type. 
The resulting tracks are termed normalized nucleosome occupancy throughout 
the manuscript. The detailed formulation of our sequence-based model for 
nucleosome positioning is given in the Methods and is similar to that described 
in ref. 17, except that it was learned using only the in vitro data. For our data, 
results and model, see http://genie.weizmann.ac.il/pubs/nucleosomes08/, and 
GEO accession number GSE13622. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Parallel sequencing of yeast nucleosomes in vivo. Four samples of yeast nucleo- 
some DNAs (‘in vivo nucleosomes’) were prepared from log-phase cells grown in 
rich medium (YPD) as described previously’. Additionally, samples were pre- 
pared from cells grown in YP media supplemented with 2% galactose (two 
samples) or 2.8% ethanol (two samples) instead of glucose. Five additional 
samples (two in YPD, one in galactose and two in ethanol) were prepared from 
cells that were formaldehyde-crosslinked before isolating nucleosomes. 
Crosslinking was achieved with 1% formaldehyde for 15min at 30°C and 
stopped with 125 mM glycine. The cells were then washed, made into sphero- 
plasts with lyticase as above, treated with micrococcal nuclease (MNase) as 
described previously’*, and nucleosomal DNAs isolated as described'®. All of 
the resulting DNAs were subjected to Illumina/Solexa sequencing-by-synthesis. 
The results from crosslinked or uncrosslinked cells for a given growth medium 
were very similar; therefore, we averaged the replicates within each growth 
medium (six YPD, four YPEtOH, three YPGal). 

Parallel sequencing of nucleosomes reconstituted in vitro on yeast genomic 
DNA. S. cerevisiae genomic DNA was purified from strain YLC8 (MATa ura3A 
leu2A his3A met15A) using standard methods with additional steps to remove 
contaminating RNA (see Supplementary Information). Histone octamer was 
purified from chicken erythrocytes using salt extraction and hydroxyapatite 
column chromatography, as described previously**. Genomic DNA was recon- 
stituted into nucleosomes under selective pressure for nucleosome-favouring 
sequences by salt gradient dialysis'’, using 40 ug histone octamer plus 100 pg 
DNA in a 200 pl volume. This results in a lower concentration of nucleosomes 
along the DNA than obtained in vivo. This technical limitation was necessitated 
by our finding that reconstitutions at the in vivo stoichiometry on this long 
genomic DNA resulted in insoluble chromatin that was inaccessible to micro- 
coccal nuclease. Two independent reconstitutions were carried out. The result- 
ing nucleosomes were biochemically isolated by MNase digestion, using 
6X10 * units MNase (Sigma Chemical Company) per 101g competitively 
reconstituted DNA, in 10mM Tris, pH 8.0, and 1mM CaCl, for 5 min at 
37°C. DNA was extracted, and protected fragments of length ~147 bp were 
isolated by polyacrylamide gel electrophoresis, and then extracted from the 
gel. Both samples were independently subjected to Illumina sequencing at the 
UNC high-throughput sequencing facility. We note that, because the chicken 
histones represent a mixture of post-translationally modified and unmodified 
histones as well as variant histones, there is a possibility that these modifications 
or variants influence the measured nucleosome positions. 

Mapping and post-processing of parallel sequencing reads. We mapped reads 
resulting from sequencing experiments to the yeast genome, allowing at most 
one mismatch and no gaps. To estimate the mean DNA fragment length in each 
experiment, we superimposed the nucleosome reads of one strand and examined 
the distribution of nucleosome reads of the opposite strand. As expected, this 
distribution showed a strong peak at ~140-170 bp for all experiments, with 
slight variations between experiments. We used the maximum of the peak as 
an estimation of the mean DNA fragment length and extended all nucleosome 
reads to this length. We defined repetitive regions as regions that were matched 
by a read that mapped to more than one place in the genome. We excluded 
repetitive regions and their 150-bp vicinity from our analyses. To obtain geno- 
mic nucleosome occupancy tracks we summed for each position all reads cover- 
ing it. We noticed that in some replicates from each of the three in vivo 
conditions (YPD, YPEtOH and YPGal), but not in either of the in vitro replicates, 
a small fraction of the mapped base pairs (in all cases less than 0.5%) were 
covered by an aberrantly large number of reads (more than ten times the median 
genomic base pair coverage) that may potentially result from sequencing biases. 
For these regions, we set the coverage of these base pairs to equal ten times the 
median genomic base-pair coverage. Next, for each map, we calculated the 
average nucleosome occupancy at every base pair by dividing the number of 
reads that cover that base pair by the average number of reads per base pair across 
the genome. Finally, for each replicate we took the log of the ratio at every base 
pair and set the genomic mean to zero by subtracting the mean from every base 
pair. Thus, base pairs for which the occupancy value is below 0 showed a relative 
depletion of nucleosomes in comparison with the genome-wide average, whereas 
base pairs with values above 0 have a relative enrichment. The independent 
replicates for each experiment type were in excellent agreement, so we averaged 
the replicates within each type to create four nucleosome occupancy maps: three 
in vivo (YPD, YPEtOH, YPGal) and one in vitro. These final tracks are termed the 
normalized nucleosome occupancy. The data in Fig. 1 are presented without the 
log transformation. 

Analysis of nucleosomes reconstituted on oligonucleotides. Two concerns 
with our experimental approach were the known sequence specificity of micro- 
coccal nuclease” and the possibilities of biases introduced by the parallel 
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DNA-sequencing strategy. We addressed both of these concerns simultaneously 
with an independent and different kind of experiment that obtains large-scale 
measurements of in vitro nucleosome sequence preferences without the use of 
micrococcal nuclease or parallel sequencing. In brief, we created a pool of 
~40,000 double-stranded oligonucleotides of length 150 bp, each flanked by 
common priming sites’, and combined the pool with limiting amounts of 
chicken histones to form nucleosomes. DNAs in reconstituted nucleosomes were 
separated from unincorporated DNAs by native gel electrophoresis, without the 
use of any micrococcal nuclease. We then extracted from the gel the DNA that 
had successfully competed to form nucleosomes, and used both parallel sequen- 
cing and microarrays to compare the nucleosomal DNAs to DNAs in the initial 
pool. For each sequence, we calculated the log-ratio between the reconstituted 
fraction and the initial pool as a measure of the nucleosome affinity of that 
sequence. Thus, a log-ratio above zero indicates a higher than average affinity 
to form nucleosomes, whereas a value below zero indicates a lower than average 
affinity. We note that this experiment is inherently noisy, as shown by a large 
variation in the number of sequence reads in the unselected reference sample, 
which may be due to the use of two rounds of PCR in the procedure. 

Specifically, we created 197-mer oligonucleotides (containing 150-mer 
unique sequences flanked by common priming sites) using a modification of 
the microarray-based method described previously”. Sequences were designed 
to represent tiled yeast genomic sequences (22,236 oligonucleotides), and a 
survey of randomly generated sequences selected to represent a wide variety of 
DNA sequence characteristics (nucleotide and dinuncleotide composition, peri- 
odic features and DNA structural parameters’). DNA obtained from the array 
was PCR-amplified and assembled into nucleosomes under selective pressure for 
nucleosome-favouring sequences by salt gradient dialysis’, using 6 ug histone 
octamer and 12 1g DNA in a 200-pl volume. The resulting nucleosomes were 
separated away from the remaining naked DNA by native polyacrylamide gel 
electrophoresis (5% (w/v) polyacrylamide) in 1/3 TBE buffer (TBE is 90 mM 
Tris, 90 mM boric acid, 2 mM EDTA, pH 8.3). The nucleosome-containing band 
was excised from the gel, and its DNA extracted by crushing and soaking into 
0.3 M NaOAc, 0.5 M NH,OAc, 0.1 mM EDTA and 0.1% (w/v) SDS, followed by 
ethanol precipitation. For Illumina analysis, the pools before and after nucleo- 
some assembly were re-amplified and Nrul sites present in the primers were 
cleaved before sequencing from the ends. In our analyses, we used oligonucleo- 
tides that were sequenced at least once and at most 500 times in each experiment. 
For microarray analyses, the pools before and after assembly were re-amplified 
with fluor-labelled primers, and hybridized to an Agilent array designed to detect 
the individual sequences represented in the pools. Additional details are available 
in the Supplementary Information. 

To remove any direct biases resulting from the primers in the computation of 
preferences over sequences of length 5 from these data (Supplementary Fig. 7), 
we omitted the primers altogether in the computation of 5-base-pair preferences 
(that is, all 5-base-pair sequences were extracted only from the variable oligo- 
nucleotides without the primers). This left open the possibility of indirect biases 
in preferences of sequences of length 5 resulting from the primers; for example, if 
one primer sequence attracted nucleosomes and if there were systematic biases in 
compositions of sequences of length 5 near that end of the designed oligonucleo- 
tides. However, half (~20,000) of the oligonucleotides were selected from the 
yeast genome, and the other half were designed in several unrelated ways (see 
Supplementary Methods) and do not have such biases. The two approaches are 
in strong agreement regarding these preferences over sequences of length 5 base 
pairs, diminishing concerns regarding biases caused by the primers (the correla- 
tions between the preferences computed from the first approach and the second 
approach are 0.81 and 0.83 for reading out the measurements of the second 
approach by parallel sequencing or microarrays, respectively; see 
Supplementary Fig. 7). 

Data sets. The yeast genome sequence (May 2006 build) and gene and chro- 
mosome annotations were obtained from SGD”. Yeast transcription start sites 
were compiled from refs 31-33: for each gene, the transcription start site was 
taken as that with the most sequence reads from refs 31 and 32, or from ref. 33 
when no sequencing data was available. Functional transcription factor DNA 
binding sites in yeast, defined as sites that are bound by their cognate transcrip- 
tion factor were obtained from refs 22 and 34. Expression levels in yeast were 
taken from ref. 26. The in vivo map of nucleosome organization in C. elegans was 
taken from ref. 19; specifically, we used both the ‘adjusted nucleosome coverage’ 
and ‘raw nucleosome coverage’, applying the same normalization method to the 
latter as we did to our yeast nucleosome maps". 

Classifying nucleosome-enriched regions from nucleosome-depleted regions. 
To test whether the two maps agree on the scale of individual nucleosomes, we 
took the target map, and from it defined nucleosome-enriched regions as max- 
imal consecutive regions longer than 50 bp for which minimum occupancy at 
every base pair is above some threshold, ¢,, and nucleosome-depleted regions as 
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maximal consecutive regions longer than 50 bp for which maximum occupancy 
at every base pair is below some threshold, ty. We chose 50 bp as a compromise 
between partitioning as much of the genome as possible into nucleosome- 
enriched and nucleosome-depleted regions while still deriving each partition 
from enough data to reduce noise that may arise when regions are too short (a 
choice of 25 bp yielded equivalent results). At various thresholds, we then scored 
each of the resulting nucleosome-enriched and nucleosome-depleted regions by 
its average occupancy in the predicting map. Note that at the threshold 
t. = tq = 0, the evaluation is done across the entire genome (typically ~97%, 
because regions <50 bp are excluded), and the median length of the nucleo- 
some-enriched regions is typically 150-190 bp. If the predicting map were fully 
predictive of the target map, then the predicting map occupancy score of every 
‘enriched’ region in the target map would be higher than that of every ‘depleted’ 
region in the target map. A standard quantification of this predictive power is the 
receiver operating characteristic curve, whose area under the curve is 1 for perfect 
performance and 0.5 for random guessing. 

Sequence-based model for nucleosome positioning. We use the in vitro map to 
represent the sequence preferences of nucleosomes by devising a probabilistic 
model, similar in formulation to that proposed in ref. 17, that assigns a score to 
every 147-bp (nucleosome-length) sequence. Our model is based on both the 
position-dependent and global preferences of sequences of length 5 that we 
characterized previously (Fig. 3). The first component, Py, represents the dis- 
tribution over dinucleotides at each position along the nucleosome length, and 
thus captures the periodic signal of dinucleotides along the nucleosome. The 
second component, P,, represents the position-independent distribution of 
nucleosomes over sequences of length 5, and thus captures sequences that are 
generally favoured or disfavoured by nucleosomes regardless of their detailed 
position within the nucleosome. We chose to represent this component using 
sequences of length 5, because this is the highest order k-mer for which our data 
have sufficient statistics to robustly estimate each of the associated parameters. 
The final score that our model assigns to a 147-bp sequence Sis then given by the 
log-ratio of these two model components: 


Py(S) 
P.(S) 


Score(S) = log 


147 
Px. (S[1])LL Py Sf] S[i—1]) (1) 
= log 17 = 


A(S[1) LA Sfi]] Sfmax (1,i—4)], ...,5[i—1]) 


where Py; is the i-th component of the dinucleotide model component and 
represents the conditional probability distribution over nucleotides at position 
i given the nucleotide that appeared at position (i— 1), and P, is the position- 
independent component of the second component of our model (P,). Note that 
Py, is represented by a mononucleotide model over the nucleotide at the first 
position. 

We now describe in detail how each of the two components of our model was 
derived. To estimate the position-specific dinucleotide component of our 
model, Px, we took all of the nucleosome reads (a total of over 10,000,000) from 
our collection of nucleosome-bound sequences, and first aligned these sequences 
about their centre. Each sequence was added twice to the alignment, once in its 
original form and once in its reverse complement form, to account for the two- 
fold symmetry in the nucleosome structure”. With each position i, we then 
associated a dinucleotide distribution, Py ;, which we estimated from the com- 
bined dinucleotide counts at alignment positions [i— 2, i— 1], [i— 1, i] and 
[i, i+ 1] (the two end positions of the nucleosome are averaged with fewer 
positions). Combining the dinucleotides at the two neighbouring positions 
smoothes the resulting dinucleotide distribution at each position with a 3-bp 
moving average, and is motivated by the experimental evidence that small + 1-bp 
changes in spacing of key nucleosome DNA sequence motifs can occur with a 
relatively small cost to the free energy of histone-DNA interactions**. To remove 
sequence composition biases from this component, we normalized the distri- 
bution over each position separately, by dividing the final probability of every 
dinucleotide at each position by the average probability of that dinucleotide 
across all positions, and finally normalized the resulting dinucleotide weights 
to a probability distribution. We used this estimation procedure in the 127 
central positions of the nucleosome, and we forced a uniform distribution over 
the 10 remaining positions at each end of the nucleosome profile. This was done 
to avoid biases in nucleotide distributions that may arise from the sequence 
specificity of the micrococcal nuclease used to isolate the nucleosome, because 
this way we do not include statistics that are taken from the cut site of the 
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nuclease. Note that our above construction produces a reverse complement 
symmetrical distribution; that is, the probability of a sequence and its reverse 
complement are equal by construction. 

The position-independent component of our model, P,, the purpose of which 
is to represent sequences that are generally favoured or disfavoured regardless of 
their position within the nucleosome, assigns a score to each 147-bp sequence, as 
the product of a position-independent Markov model, Pj, of order 4. Thus, P; 
defines a probability distribution over every one of the 1,024 possible sequences 
of length 5. We chose to model the distribution over sequences of length 5, 
because this is the highest order for which our data still provides sufficient 
statistics to robustly estimate the value of each of the 1,024 parameters. Given 
a map of nucleosome positions, the probability of each sequence of length 5 is 
equal to the reciprocal of the average nucleosome occupancy of that sequence 
across all instances of the 5-base-pair-long sequence in the map, where this 
reciprocal average occupancy is then scaled to a probability by dividing it by 
the sum of all such reciprocal occupancies across all 5-base-pair-long sequences 
(we use the reciprocal of the average occupancy so that higher probabilities 
correspond to disfavoured sequences). Thus, this Markov model, Pj, includes 
contributions both from sequences that are disfavoured by nucleosomes and 
from sequences that are favoured by nucleosomes, because it models the distri- 
bution over all sequences of length 5, with the disfavoured sequences having a 
relatively high probability and the favoured sequences having a relatively low 
probability. 

The above probabilistic model assigns a nucleosome formation score to each 
sequence of (nucleosome-length) 147 bp. We then followed the approach of 
ref. 5, and used the scores of this model to compute the genome-wide distri- 
bution over nucleosome positions, taking into account steric hindrance con- 
straints between neighbouring nucleosomes. These computations can be done 
efficiently using dynamic programming, and their output is a track of the average 
nucleosome occupancy per base pair, that is, the probability that each base pair in 
the genome is covered by any nucleosome. 

In developing our model, we also separately evaluated the performance of the 
Py and Py, components, and found that the performance of the P,, component 
alone is nearly the same as that of the full model (per-base-pair nucleosome 
occupancy correlation of 0.876 between a P,-only model and the in vitro map, 
compared with 0.880 for the full model using both P, and Py), whereas the 
performance of the Py component alone is highly predictive but slightly worse 
than the full model (correlation of 0.820). Thus, for practical purposes of gen- 
erating predictions, using only the P, component may suffice. Nevertheless, the 
Py component captures real periodic aspects of nucleosome sequence prefer- 
ences that have been reported by several other groups*®’’, and that we now find 
in the genome-wide in vitro nucleosome map in Fig. 3b. For these reasons, we 
retain the Py component as part of the model. 

For the synthetic oligonucleotide analysis (Supplementary Fig. 7), we used the 
nucleosome formation potential of equation (1). In all other analyses we con- 
verted this genome-wide nucleosome formation potential into genome-wide 
nucleosome occupancy per base pair, as described in ref. 5. We selected the 
concentration and temperature parameters (t and f3, respectively) in a cross- 
validation manner to be t = 0.03, f = 1 in yeast and t = 0.1, B = 1 in C. elegans. 
Finally, to appropriately compare the model with log-transformed data, we 
applied to the predictions of the model the same log normalization that was 
applied to the data. 
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Transmembrane passage of hydrophobic compounds 
through a protein channel wall 


Elizabeth M. Hearn’, Dimki R. Patel’, Bryan W. Lepore’, Mridhu Indic’ & Bert van den Berg’ 


Membrane proteins that transport hydrophobic compounds have 
important roles in multi-drug resistance’? and can cause a num- 
ber of diseases**, underscoring the importance of protein- 
mediated transport of hydrophobic compounds. Hydrophobic 
compounds readily partition into regular membrane lipid 
bilayers®, and their transport through an aqueous protein channel 
is energetically unfavourable’. Alternative transport models invol- 
ving acquisition from the lipid bilayer by lateral diffusion have 
been proposed for hydrophobic substrates**”-”. So far, all trans- 
port proteins for which a lateral diffusion mechanism has been 
proposed function as efflux pumps. Here we present the first 
example of a lateral diffusion mechanism for the uptake of hydro- 
phobic substrates by the Escherichia coli outer membrane long- 
chain fatty acid transporter FadL. A FadL mutant in which a lateral 
opening in the barrel wall is constricted, but which is otherwise 
structurally identical to wild-type FadL, does not transport sub- 
strates. A crystal structure of FadL from Pseudomonas aeruginosa 
shows that the opening in the wall of the f-barrel is conserved and 
delineates a long, hydrophobic tunnel that could mediate sub- 
strate passage from the extracellular environment, through the 
polar lipopolysaccharide layer and, by means of the lateral open- 
ing in the barrel wall, into the lipid bilayer from where the sub- 
strate can diffuse into the periplasm. Because FadL homologues 
are found in pathogenic and biodegrading bacteria, our results 
have implications for combating bacterial infections and bioreme- 
diating xenobiotics in the environment. 

The outer membrane of Gram-negative bacteria provides an effi- 
cient barrier for the passage of hydrophobic molecules owing to the 
presence of the polar lipopolysaccharide (LPS) layer on the outside of 
the cell. The only protein family currently known to be involved in 
the uptake of hydrophobic molecules is the FadL family, named after 
the archetypal long-chain fatty acid (LCFA) transporter FadL of 
E. coli'*"*. The crystal structures of E. coli FadL’’ revealed a 
monomeric 14-stranded B-barrel with an interior occluded by an 
amino-terminal hatch domain, and an inward-pointing kink in 
B-strand S3 that creates an unusual, lateral opening in the transmem- 
brane barrel'°. On the basis of the FadL structures, two possible LCFA 
transport mechanisms can be envisioned, both of which are 
diffusion-based because FadL-mediated transport does not require 
an energized membrane". In the ‘classical’ transport model, the 
hatch undergoes conformational changes creating a transient 
channel for LCFA transport transverse to the membrane, from the 
extracellular medium directly to the aqueous periplasm’. The 
alternative ‘lateral diffusion’ transport model is based on the obser- 
vation that an LCFA-mimicking LDAO detergent molecule is bound 
in the lateral opening of E. coli FadL (Fig. 1a, b). In the lateral dif- 
fusion model, LCFAs exit FadL laterally through the opening in the 
barrel wall to move into the outer membrane, from where they could 
diffuse into the periplasm. To answer the question regarding which 


transport model is operative for FadL, we designed a number of FadL 
mutants focusing on the hatch domain and the lateral opening. 

Two FadL mutants, AS3 kink and A77E/S100R, were designed to 
close specifically the barrel wall opening in different ways. In AS3 
kink, four residues in strand S3 that line the lateral opening 
(°°SNYG'"?) were replaced with three residues (AND) to bring 
strand $3 into register with the neighbouring strands and to desta- 
bilize the conformation of the $3 kink. In A77E/S100R, residues with 
long, oppositely charged side chains were introduced for residues 
Ala77 and Ser 100, which are located in the barrel wall on different 
sides of the opening (Fig. 1c) with their side chains pointing towards 
each other. We anticipated that the introduced glutamic acid and 
arginine might form a salt bridge, closing the lateral opening. We 
determined the crystal structures of the AS3 kink and A77E/S100R 
mutants (Supplementary Table 1). The AS3 kink mutant, although 
largely identical to wild-type FadL, lacks density for several extracel- 
lular loops (Supplementary Fig. 1). However, the A77E/S100R struc- 
ture is identical to that of wild-type FadL (Fig. 1d, e), with the sole 
exception of the lateral opening, which is much smaller in the A77E/ 
S100R mutant (Fig. 1c). Thus, the introduced mutations for residues 
Ala 77 and Ser 100 are unlikely to affect the interior of the protein, 
strongly arguing against an inhibitory effect of the mutations on a 
possible conformational change in the hatch that would form a clas- 
sical transport channel. Crucially, both the AS3 kink and A77E/ 
S100R mutants were completely inactive for in vivo LCFA uptake 
and growth on palmitate (Fig. 2). Moreover, the low transport acti- 
vity (~3%) of the S100R single mutant (Fig. 2) demonstrates that the 
introduction of a single long side chain is sufficient to inhibit LCFA 
uptake efficiently. The structural and biochemical data together 
demonstrate that constricting the lateral opening in the FadL barrel 
wall is sufficient to block LCFA uptake. Although we can not exclude 
the possibility that subtle changes in barrel structure and dynamics, 
introduced by the mutations, affect the formation of a lateral opening 
in other parts of the barrel, our data are most easily explained by a 
transport model in which LCFAs diffuse laterally from the lumen of 
the barrel, through the observed opening in the barrel wall, into the 
outer membrane. 

If transport by lateral diffusion is a general feature of FadL channels, 
then the opening in the barrel wall should be structurally conserved. To 
test this notion we have determined the crystal structure 
(Supplementary Table 2) of a FadL homologue from Pseudomonas 
aeruginosa (PaFadL), which has low (20%) sequence identity to E. coli 
FadL (EcFadL). Despite the modest sequence identity, PaFadL is struc- 
turally similar to EcFadL, with the exception of a number of extra- 
cellular loops (Fig. 3a, b). PaFadL has a pronounced lateral opening in 
the barrel wall at the same location as in EcFadL (Fig. 3a), indicating 
that the lateral opening is conserved in FadL family members. The 
most notable feature of the PaFadL structure is the presence of well- 
defined density for three complete tetraethylene glycol monooctyl 
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Figure 1| Structural features of FadL mutants. a, b, Ribbon diagram of 
wild-type FadL highlighting the S3 kink, viewed laterally (a) and from the 
extracellular side (b). An LDAO molecule (dark blue) protrudes through the 
opening in the barrel wall between strands $2 and S3 (dark pink). The hatch 
domain is coloured yellow. The interface boundaries of the outer membrane 
bilayer are indicated by horizontal lines. c, Surface views of the kink region 
(with strands $2 and $3 indicated) of wild-type FadL (left), AS3 kink 


ether (CgE,) detergent molecules inside the barrel lumen (Fig. 3c). The 
first two detergent molecules (1, 2) are present at positions that are 
analogous to those occupied by detergent molecules in EcFadL’’. The 
third CgE, molecule (3) is located in the lateral opening, analogous to 
the LDAO molecule in wild-type FadL (Fig. la, b). The presence of a 
detergent molecule in the lateral opening indicates that this part of the 
potential substrate passageway has a substantial affinity for LCFA 
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Figure 2 | Functional analysis of FadL proteins. Specific activity expressed 
relative to wild type for oleate uptake measured in whole cells at 20 °C. Bars 
represent the average of at least four independent measurements, and the 
error bars represent the standard deviation. Growth on palmitate minimal 
medium plates at 37 °C, ranging from wild-type levels (+ ++) to no growth 
(—), is indicated above the bars. The hatch mutants N33A and ANPA do not 
support growth of E. coli on palmitate at 37 °C, because the expression levels 
of these two mutants are below detection limits at this temperature. The 
bottom panel shows representative western immunoblots indicating the 
expression levels of the various proteins in the outer membrane at 20 °C. Tsx 
is the E. coli outer membrane nucleoside transporter, which is included as an 
additional negative control. 


368 


NATURE|Vol 458|19 March 2009 


(middle) and A77E/S100R (right), showing the smaller lateral opening in 
AS3 kink and A77E/S100R. The side chains for the residues at positions 77 
and 100 are shown. d, e, Backbone superpositions of wild-type FadL 
(orange) and A77E/S100R (green; d, hatch domains only). In e, 2F,—F, 
density (contoured at 1.50) is shown as a blue mesh for segments of 
B-strands S2 and S3 in A77E/S100R, with the side chains of residues Glu 77 
and Arg 100 indicated. 


substrates. This is borne out by the fact that the environment of the 
detergent molecule in the opening is largely hydrophobic 
(Supplementary Fig. 2), like the other LCFA-binding sites in FadL">. 

The three detergent molecules in PaFadL clearly delineate a long 
(~55 A) passageway that runs from the extracellular surface all the 
way to the lateral opening in the barrel wall. Of the ~50 residues that 
are located within 4.5 A of the detergent molecules, more than 80% are 
hydrophobic (Supplementary Fig. 2). Moreover, structural and 
sequence alignments show that the hydrophobic character of these 
residues is conserved. The hydrophobic channel is continuous, and is 
interrupted only by the first three amino-terminal residues of the 
hatch, most notably Phe 3. Remarkably, a conformational change of 
the N terminus, as previously observed in EcFadL”’, would generate an 
uninterrupted, hydrophobic passageway for substrate diffusion all the 
way from the extracellular medium to the lateral opening 
(Supplementary Fig. 3). As in EcFadL (Fig. la), the lateral opening 
in PaFadL is situated in the region of the polar—apolar interface of the 
outer leaflet of the outer membrane (Fig. 3c). This location makes 
sense, because it probably provides a favourable environment for both 
the carboxylate head group and the hydrocarbon chain as the LCFA 
emerges from the FadL lumen. Our observation is in agreement with 
recent data obtained for multidrug efflux pumps, where a range of 
hydrophobic substrates was shown to be preferentially localized in the 
interface region of the lipid bilayer®. Despite the structural similarity 
to EcFadL, PaFadL has very low activity for LCFA transport (Fig. 2), 
confirming that FadL channels are substrate specific’’. 

To support our lateral diffusion model, we also asked whether the 
hatch domain of EcFadL is flexible. The hatch NPA sequence (residues 
33-35) is absolutely conserved and is a signature of FadL channels. 
The Asn 33 amide side chain of the NPA sequence forms a hydrogen 
bond with the Gly 21 backbone carbonyl, bringing distant parts of the 
hatch together (Supplementary Fig. 4). The removal of this hydrogen 
bond would seem a good candidate to probe whether the hatch 
domain can undergo conformational changes. We determined the 
X-ray crystal structures of the mutants N33A, P34A and ANPA 
(N33G/P34G/A35G; Supplementary Table 1), and found that they 
are very similar to wild-type FadL (Supplementary Figs 1 and 4) 
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Figure 3 | A hydrophobic passageway for substrate diffusion in PaFadL. 
a, Superposition of EcFadL (green) and PaFadL (red), showing the 
conservation of the lateral opening. b, Superposition of the hatch domains. 
c, Stereo side view of PaFad_L, with the three bound CE, detergent molecules 


and do not have a hatch channel. The same is true for the G212E 
mutant (Supplementary Figs 1 and 4), which previously was proposed 
to have an open channel’*. Consistent with the structural data, all 
hatch mutants are active in oleate transport (Fig. 2). The combined 
structural and biochemical data indicate that the NPA sequence is not 
directly involved in substrate transport, but it may be important for 
proper folding or outer membrane targeting of FadL. In addition, it 
appears that the hatch domain is rigid, providing support for the 
lateral diffusion transport model. It should also be noted that, 
although the hatch domain of EcFadL has a number of hydrophobic 
residues, these are interspersed with many polar residues 
(Supplementary Fig. 5). Therefore, even if a hatch channel in 
EcFadL could form by spontaneous conformational changes, it would 
not provide a suitably hydrophobic conduit for LCFA transport. 

We now propose a general, lateral diffusion mechanism for the 
uptake of hydrophobic substrates by the FadL outer membrane protein 
family (Fig. 4). According to this mechanism, hydrophobic substrates 
diffuse by means of a long hydrophobic passageway laterally into the 
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Figure 4 | Proposed lateral diffusion model for the uptake of hydrophobic 
substrates by FadL proteins. a, Substrate (red hexagon) capture from the 
extracellular medium by a low-affinity binding site (L)"*; b, diffusion of the 
substrate into an adjacent high-affinity binding site H (blue)"’; 

c, spontaneous conformational changes in the N terminus (purple) result in 
substrate release and create a continuous passageway to the barrel wall 
opening formed by the kink in strand $3. The substrate diffuses laterally 
through the opening into the outer membrane (OM). The polar part of the 
LPS, constituting the principal barrier in the transport process, is shown in 
grey. The extracellular milieu (E) is at the top and the periplasm (P) is at the 
bottom. 
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indicated in red. 2F.,— F. density is shown as a blue mesh, contoured at 2.0. 
The hatch domain is coloured green. The belts of aromatic residues that 
delineate the polar—apolar interfaces of the outer membrane are shown as 
orange stick models. 


outer leaflet of the outer membrane, probably by means of a stable, 
lateral opening in the barrel wall. From the outer leaflet the substrate 
can move to the inner leaflet of the outer membrane and diffuse into 
the periplasm. By using this mechanism, the hydrophobic substrates 
bypass the hydrophilic LPS layer of the outer membrane without 
having to move through an aqueous channel, which would be ener- 
getically unfavourable owing to the extremely low aqueous solubilities 
of FadL substrates (<0.1nM for palmitate'’). Intriguingly, medium- 
chain fatty acids do not require FadL for uptake in E. coli”°. Apparently 
these relatively water-soluble substrates (>0.5mM for laurate'’) use 
other outer membrane channels for uptake, probably porins. Recently, 
crystal structures of Pseudomonas putida TodX and Ralstonia pickettii 
TbuX were determined, two members of a subfamily of FadL proteins 
from biodegrading bacteria involved in the uptake of mono-aromatic 
hydrocarbons*!”*. As expected, TodX and TbuX both have a kink in 
strand S3 as well as a lateral opening at the same position as in FadL”, 
providing additional support for the generality of the lateral diffusion 
mechanism in FadL proteins. However, TodX and TbuX also have a 
narrow, continuous channel through the hatch domain that might 
serve as a classical channel. The TodX/TbuX hatch channel is relatively 
polar’’, and may have evolved for the uptake of relatively water-soluble 
(~1-5mM)” mono-aromatic hydrocarbons in bacteria lacking 
porins. Future experiments will be required to establish whether the 
hatch channel in TodX/TbuX contributes to transport. 

Structural features resembling the lateral opening of FadL channels 
have been observed in two other unrelated outer membrane proteins, 
PagP and OmpW. The lateral opening in the lipid A palmitoyl-trans- 
ferase PagP may allow access of the lipid substrate to the active site of 
the enzyme. OmpW belongs to a widespread family of eight- 
stranded B-barrels*’, with members present in operons dedicated 
to the degradation of hydrophobic molecules such as naphthalene 
(NahQ)** and alkanes (AlkL)”” in pseudomonads, indicating that 
they may form uptake channels for these compounds. Such uptake 
would have to occur by lateral diffusion, because the eight-stranded 
barrel lumen is too narrow to form a classical channel**. Thus, bac- 
terial substrate uptake by lateral diffusion into the outer membrane 
may be a widespread phenomenon. Taking into account the known 
occurrence of lateral diffusion in multidrug transporters, this mode 
of transport probably represents a universal mechanism for mem- 
brane proteins that modify and transport hydrophobic substrates. 


METHODS SUMMARY 


LCFA functional assays were performed using a modified approach from that 
described previously'*. FadL mutant proteins were overexpressed and purified 
from E. coli C43 (DE3) as described for the wild-type protein’’. Crystals were 


369 


©2009 Macmillan Publishers Limited. All rights reserved 


LETTERS 


obtained by the hanging-drop method, and structures were solved by molecular 
replacement using as a search model the monoclinic FadL structure (Protein 
Data Bank accession code 1T16) in which the hatch domain and the kink were 
deleted. PaFadL was overexpressed in native form in E. coli C43 (DE3), and 
purified as EcFadL. The PaFadL crystal structure was determined by the multiple 
isomorphous replacement with anomalous scattering (MIRAS) method, using a 
gold and osmium derivative obtained by the quick soaking method”. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

FadL mutagenesis. The E. coli fadL gene, including the signal sequence and a 
carboxy-terminal hexahistidine tag, was cloned as previously described" into the 
pBAD22 vector, which is under the control of the arabinose-inducible pro- 
moter”. Mutations were introduced into the fadL gene by using the 
QuikChange Site-directed Mutagenesis kit (Stratagene), and the mutations were 
verified by nucleotide sequencing. 

LCFA functional analyses. For the functional analyses, E. coli LS6164 
AfadR AfadL” was transformed with the pBAD22 plasmids carrying the mutant 
fadL genes and FadL homologues. As a negative control, the pBAD22 plasmid 
carrying the outer membrane nucleoside transporter Tsx*' was introduced into 
E. coli LS6164. The ability of the mutant proteins to support growth on LCFAs 
was measured by plating cells (5 X 10° colony-forming units per ml) on agar 
plates containing 5mM sodium palmitate (Sigma), 0.5% (w/v) Brij 58, M9 
minimal medium, and 1.5% (w/v) Noble agar (Difco). Growth was scored after 
96-h incubation at 37 °C. 

Transport assays with radiolabelled oleic acid were performed with modifica- 

tions to the method described previously". E. coli LS6164 AfadR AfadL cells with 
the FadL mutant and pBAD22 plasmids were grown in Luria Bertani medium to 
mid-log phase, and FadL expression was induced with 0.01% (w/v) arabinose at 
20°C for 5h. Cells were collected, washed in EB1 buffer (10 mM citric acid, 
0.8mM magnesium sulphate, 20mM sodium ammonium phosphate, 60 mM 
potassium phosphate, 0.02 mM thiamine), and resuspended to a Doo of 1 in EB1 
buffer with 0.5% (w/v) Brij 58. After 30 min starvation at 37 °C, samples were 
taken for western immunoblotting analysis and also for measurement of oleic 
acid transport activity. The transport assay was performed by diluting the cells in 
EB1 buffer containing 20mM glucose, 0.5% (w/v) Brij 58, and 0.02 Ci [?H- 
9,10]-oleic acid (Sigma, specific activity 30.0-50.5 pCi mmol '). At 25min, 
samples were removed and filtered through 0.45-11m membrane filters 
(Metricel GN-6, Pall Life Sciences). Filters were washed with EB1 buffer contain- 
ing 0.5% Brij 58, and the radioactivity retained on the filters was counted. Counts 
for the cells expressing the FadL protein were corrected for the background 
radioactivity associated with cells containing an empty pBAD22 plasmid. To 
determine the amount of FadL protein expressed in the outer membrane, cells 
were incubated with BugBuster (Pierce) while shaking for 30 min at 37 °C, and 
centrifuged for 10 min at 20,000g. Proteins were detected by western immuno- 
blotting using the Penta-His-horseradish peroxidase conjugate antibody 
(Qiagen) and the ECL chemiluminescence detection kit (GE Healthcare). 
Proteins were quantified against FadL standards using the ImageJ program’. 
The transport activities of the cells expressing mutant FadL proteins were cor- 
rected for FadL expression levels, and the rates were expressed as a percentage of 
wild type. 
Purification, crystallization and structure determination of FadL mutants. 
For crystallization, the FadL mutant proteins were expressed from the pBAD22 
plasmids in E. coli C43 (DE3) cells** grown in 2X YT medium by induction with 
0.2% (w/v) arabinose for 6 h at 30 °C. Cells were collected, and the proteins were 
purified from the total membrane fraction as previously described for wild-type 
FadL" by Ni affinity chromatography in LDAO, gel filtration chromatography in 
LDAO (Anatrace), and a final gel filtration chromatography step in CgE, 
(Sigma). The purified proteins were concentrated to 5-10 mgml ' and flash- 
frozen in liquid nitrogen. 

Crystals were obtained by the hanging-drop method at 22 °C using commer- 
cially available screens (The Classics and MB Class I, Qiagen) or in-house screens. 
The mutant FadL proteins crystallized under the following conditions: N33A, 
0.1M cadmium chloride, 0.1M sodium acetate (pH 4.6), 30% (v/v) PEG 400; 
P34A, 0.2 M ammonium acetate, 0.1 M sodium acetate (pH 4.6), 30% (w/v) PEG 
4000; ANPA, 0.05 M magnesium acetate, 0.05 M cacodylate (pH 5.5), 35% (w/v) 
PEG 2000; G212E, 0.2 M zinc acetate, 0.1 M cacodylate (pH 6.5), 18% (w/v) PEG 
8000; AS3 kink, 0.2 M ammonium sulphate, 0.1 M MES (pH 6.5), 30% (w/v) PEG 
5000 MME; and A77E/S100R, 0.1 M sodium chloride, 0.1 M citrate (pH 5.6), 16% 
PEG 4000. The crystals were flash frozen (100 K) in the reservoir solution contain- 
ing CgE, and 15-25% (v/v) glycerol by plunging in liquid nitrogen. 

Diffraction data were obtained on beamline X6A at the National Synchrotron 
Light Source (Brookhaven National Laboratory). Data sets were integrated and 
scaled using HKL2000 (ref. 34). The structures of the mutant proteins were 
solved by molecular replacement using Phaser”; the monoclinic FadL structure 
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(PDB accession number 1T16) without the N-terminal 40 amino acids and the 
S3 kink residues 99-108 was used as the search model. Model building was done 
using COOT”, and refinement was done with CNS”. Data collection and refine- 
ment statistics for the mutant FadL proteins are summarized in Supplementary 
Table 1. 

Cloning of PaFadL. The signal sequence cleavage site for PaFadL was predicted 
to occur between residues 20 and 21 using the SignalP program”. For expression 
and outer membrane localization of PaFadL, the mature gene (lacking the endo- 
genous signal sequences and the first two residues, and additionally modified 
with a C-terminal hexahistidine tag) was amplified by PCR from P. aeruginosa 
PAOI (ATCC 47085) genomic DNA. The mature gene fragment was cloned in- 
frame with the E. coli FadL signal sequence (plus the first two residues of the 
mature FadL sequence) into the pBAD22 vector”. 

Purification, crystallization and structure determination of PaFadL. PaFadL 
was expressed in E. coliC43 (DE3) cells** and purified using the detergents LDAO 
and CgE, as described for E. coli FadL’. Crystallization trials of PaFadL were set 
up using the hanging-drop technique with commercially available crystallization 
screens (The Classics and MB Class II screens, Qiagen). Crystals for the PaFadL 
were obtained at 22°C in 0.1M lithium sulphate, 0.1 M MES buffer (pH 6.5), 
and 28-32% (w/v) PEG 400. The crystals typically appeared after 10 days and 
grew to their full size (~15 X 30 X 100 ttm) in 3-4 weeks. They belonged to space 
group C222), diffracted to a resolution of 2.1 A, and contained one molecule in 
the asymmetric unit (Matthews coefficient Vy ~ 4.2 A° Da _!, corresponding to 
~55% (v/v) solvent content”). The crystals were flash frozen (100K) directly 
from the drop by plunging in liquid nitrogen. 

A native data set was collected, as well as data sets for crystals that were soaked 
for ~40 min in 10mM OsCl; or KAuCly. The data sets were processed using 
HKL2000 (ref. 34), showing that the osmium and gold crystals were isomorph- 
ous with the native crystals (Supplementary Table 2). Two osmium and two gold 
sites were found by multiple isomorphous replacement in SOLVE (Z-score 10.2; 
FOM 0.39 for data between 20A and 2.5 A)’. Five additional gold and three 
osmium sites were found in isomorphous/anomalous difference maps and 
refined using SHARP“. Electron density maps obtained from SHARP were used 
for automatic model building in RESOLVE”, which resulted in ~60% of the 
model being built. Further model building was done manually in COOT’*, 
followed by refinement in CNS 1.2 (ref. 37). Data collection and refinement 
statistics for PaFadL are summarized in Supplementary Table 2. 
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hen President Barack Obama lifted the ban on US federal funding 

of embryonic stem-cell research on 9 March, he did more than 

make a symbolic gesture in favour of advancing science; and he 

did more than take a step towards long-touted Cif not guaranteed) 
disease cures. His statement had immediate economic repercussions. In the near 
term, there is little chance that freeing up the research will bring cures — but there 
is reason to believe the move will bring near-term jobs and investment. 

Obama said the administration of George W. Bush had forced a “false choice 
between sound science and moral values”. Science and morality are not 
inconsistent in this case, he argued, saying that embryonic stem-cell research is 
part of ahumane effort to relieve human suffering. The president went on to imply 
that lifting the ban is part of a larger plan to give scientists the support they deserve. 
Scientists should be shielded from unnecessary political influence so that they can 
do their jobs “free from manipulation or coercion”. 

The decision, however, is also about winning the war for top scientific talent. 
California's efforts to do so are well known: its US$3-billion stem-cell initiative, 
although struggling of late because of the state’s budget woes, has attracted some 
accomplished biologists. Other states have tried to follow suit, although they 
may now scale back their own funding programmes to make way for government 
funding (see Nature 458, 130-131; 2009). Countries from South Korea to the 
United Kingdom to Australia have pursued not only the research results, but the 


researchers as well. 


Obama did note the upshot of scant government support. “Promising avenues go 
unexplored,” he said. “Some of our best scientists leave for other countries that will 
sponsor their work. And those countries may surge ahead of ours in the advances 
that transform our lives.” Obama's announcement intensifies the race for cures 
and reaffirms support for a field. It also raises the stakes in the race to find the best 
stem-cell biologists on the planet. 
Gene Russo is editor of Naturejobs. 
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Personnel <3, 
dilemma <= 


The hiring of US science faculty 
members has slowed considerably. 
Karen Kaplan tracks the 
administrators who are trying to 
keep programmes intact. 


he stain of the international 
economic downturn has 
spread worldwide, but 
science institutions in the 
United States seem to be feeling its 
effects more severely than those in 
other nations. And it is not clear 
whether the science portion of the 
$787-billion federal economic 
stimulus package — $10.4 
billion of which is slated for the 
National Institutes of Health 
and $3 billion for the National 
Science Foundation — will 
help schools mitigate 
their recruitment woes 
(see Nature doi:10.1038/ 
news.2009.126; 2009). 
For now, recruitment efforts 
are rapidly decelerating virtually 
across the board, as institutions 
struggle to expand — or, in some 
cases, simply maintain — their 
science faculties (see ‘Business as 
usual’ and “House-bound’). It has 
prompted some university chancellors 
and provosts to nervously wonder just 


how long the recession’s effects will last and whether Clemson University’s cuts have already affected FA 
their universities’ core mission may start to decay. the university's science programmes, including 8 
State universities have absorbed the hardest blow. reductions in the number of research animals forits 3 
Stony Brook University in New York had a 10% across- agricultural programme. Teaching duties could be 2 
the-board budget cut. “As a result, we've had to restrict another casualty. Mark Schlissel, dean of biological g 
hiring dramatically,’ says provost Eric Kaler. “In a good sciences at the University of California, Berkeley, says ~ 
year, we might recruit and hire a dozen tenure-track he hopes to keep the teaching programme intact while 
faculty in the sciences. This year, we will probably waiting for the economy to improve. The university has 
do one?’ Consequences could range from increased significantly curtailed its recruitment efforts. 2 
teaching loads for faculty members to fewer lab 3 
sections and specialized graduate courses. “It means,’ Investing in people e 


Where endowments are more robust, and hence 
interest earned on investments still healthy, outlooks 
are brighter. Tom Skalak, vice-president for research 
at the University of Virginia in Charlottesville, is 
sanguine about a faculty recruitment slowdown of 
10-20% this year. The university has other sources of 
revenue besides the state, including philanthropic and 
corporate contributions, as well as an endowment that 
has fared relatively well. “There’s been a slowdown but 
were able to hire,’ Skalak says. 


Kaler says, “that maintaining our level of excellence is 
more difficult.” 

The effects have been felt nationwide. Stanford 
University in Palo Alto, California, has cancelled or 
delayed recruitment at its various schools, a university 
spokeswoman says. Similarly, Johns Hopkins 
University in Baltimore, Maryland, in February 
imposed a university-wide hiring freeze (except at the 
university's Applied Physics Laboratory), effective until 
June 2010. Clemson University in South Carolina also 


imposed a university-wide hiring freeze in October University of Virginia “We're hiring ten new science faculty this year, 
last year. vice-president Tom Skalak. and that’s firm,” says David Dolling, dean of the 
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SPECIAL REPORT 


Inthe United States, 
university recruitment 
is limping along, largely 
as a result of recession- 
induced shrinking 
endowments and 
dwindling state coffers. 
But some other countries’ 
science programmes 
are not yet suffering the 
same fate. “So far, it’s 
business as usual,” says 
a spokesperson for the 
University of Oxford, UK. 
The University of Exeter, 
also in Britain, is recruiting 
29 science faculty 
members, according 
to Janice Kay, a deputy 
vice-chancellor. Targeted 
disciplines include climate 
change, extrasolar planets, 
functional materials, 
systems biology and 
translational medicine. 
“We are recognizing a very 
severe recession,” she 
says, “but we're not going 
to be diverted from our 
mission of expanding first- 
class research at Exeter.” 
Germany's University of 


BUSINESS AS USUAL 


University of Exeter deputy vice-chancellor Janice Kay 
(left) and University of Toronto vice-provost Edith Hillan. 


an agreement with the state 
of Schleswig-Holstein that 
guarantees it will receive 
five years of funding, some 
of which is devoted to 
recruitment, according to 
Susanne Schuck-Zoller, 
head of the university's 
communications. And the 
downturn has not affected 
scientist recruitment at 

the country’s Max Planck 
institutes. “Our budget 

has not been pruned,” says 
Felicitas von Aretin, head of 


Kiel is insulated by virtue of — science communication. 


Nor has science-faculty 
recruitment faltered at 
the University of Toronto 
in Canada, although the 
university isn't hiring 
recklessly. “Divisions 
are being a bit more 
cautious,” says Edith 
Hillan, vice-provost for 
academic affairs, noting 
that tuition revenues and 
federal funding continue 
to be strong. “If there's no 
absolute need to hire, then 
they're making the decision 
to wait another year.” K.K. 


school of engineering and applied science at George 
Washington University in Washington DC. Dolling 
says a conservative investment strategy has protected 
the university's endowment from complete collapse. 
“Our institutional prerogative is that we are growing 
engineering and sciences.” 

Peter Lepage, dean of the college of arts and sciences 
at Cornell University in Ithaca, New York, says his 
university is relatively unscathed, at least for now, 
thanks largely to an aggressive recruiting and hiring 
strategy in recent years and other funding sources 
besides state monies. “The size of my faculty has grown 
by 6% or 7%,’ he says. “So even if we rolled it back, we 
would be setting ourselves back a decade, not to the 
nineteenth century.’ 

Like several other institutions that are expanding 
their faculties, Northeastern University in Boston, 
Massachusetts, is working to leverage its new hires by 
making sure they can operate across disciplines. “We 


Cornell University dean 
Peter Lepage. 


might have biologists working with sociologists,” says 
provost Stephen Director, referring to the university's 
multidisciplinary ‘strategic initiative, which helps 
spread out costs as well as strengthen the university's 
teaching programme. “We're trying,” Director says, “to 
build our core strengths.” Northeastern University has 
46 faculty positions to fill, about half of which are in 
basic science. 


Adapt or die 

At Pennsylvania State University in State College, 
hiring for physical-science faculty members has an 
emphasis on inter-disciplinarity, according to William 
Easterling, dean of the college of Earth and mineral 
sciences. “As long as we're willing to hire physical 
scientists who are adept at working across disciplines, 
then we're able to continue with our hiring,” he says. 

Some universities actually see the slowdown as an 
opportunity to collect top talent. Despite a serious 
endowment dip, Harvard University in Cambridge, 
Massachusetts, is seeking more than 50 new faculty, 
including science posts, said president Drew Faust 
in an announcement to alumni. Faust said that 
recruitment has slowed after a number of years of 
faculty growth, but that the university will continue to 
“plan for intensified efforts in select areas of academic 
priority”. 

Although Tufts University in Boston, Massachusetts, 
has put some faculty searches on hold, it is actively 
recruiting 52 faculty members, some of whom are in 
the sciences, according to spokeswoman Kim Thurler. 
“We see opportunities to attract outstanding faculty,” 
she says. 

A prolonged downturn could diminish any sense of 
optimism, however. “One year is not enough to hurt,’ 
says Steve Halperin, dean of the college of computer, 
mathematical and physical sciences at the University 
of Maryland in College Park. “But over a period of 
multiple years, if there's a significant [faculty] decrease, 
then I’m looking at a future with fewer faculty, and 
that will change a lot of how we do business.” Halperin 
is expecting to hire two or three faculty members this 
year, down from a high of seven or eight during a good 
year. 

Some predict even more dire consequences. An 
extended downturn could affect “new thinking, new 
ideas, new life in the nation’s graduate programmes’, 
says Robert Sowell, vice-president for programmes 
and operations at the Council of Graduate Schools in 
Washington DC. “The United States’ position relative 
to the rest of the world could start slipping,” he says. “I 
dont think this is a situation we can't recover from, but 
the real concern is our global competitiveness.” a 
Karen Kaplan is assistant editor of Naturejobs. 


HOUSE-BOUND 


Some US universities report that 
recruitment has been hampered 
by more than budget woes. The 
wider implications of the recession 
have hit potential recruits and 
affected their ability to accept 
positions. William Easterling, dean 
of the college of Earth and mineral 
sciences at Pennsylvania State 
University in State College, says 


some candidates, for example, 
have declined offers because they 
can't sell their homes. “Faculty 

at all levels, from junior to senior, 
are finding it very, very difficult to 
accept our offers because they 
cannot divest themselves from 
their current living arrangements,” 
Easterling says. “They can't sell 
their houses — especially if they're 


from an area hard hit by the sub- 
prime mortgage market.” 

The downturn threatens to 
exacerbate the notorious ‘two- 
body’ problem — handling a 
partner's job needs as well as the 
recruit's. Spouses of recruits are 
baulking at leaving their current 
position if the recruiting university 
can't offer them a job. Recently, 


Easterling says, Pennsylvania State 
University extended an offer to a 
petroleum engineer at a Canadian 
university. The engineer seemed 
receptive. But he soon revealed 
that he could not sell his home, and 
that his wife, a pharmacist, was 
concerned about finding a new 

job in State College. “He turned us 
down," says Easterling. K.K. 
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MOVERS 


Louise Ryan, chief, mathematical and information 
sciences, Commonwealth Scientific and Industrial 
Research Organisation, Sydney, Australia 


2006-09: Chair, Department 
of Biostatistics, Harvard 
School of Public Health, 
Boston, Massachusetts 
1985-2009: Assistant to full 
professor, Department of 
Biostatistics, Harvard School 
of Public Health, Boston, 
Massachusetts 


Louise Ryan's humanitarian spirit resides in the mind of a 
mathematician. After recognizing her affinity for solving 
real-world problems, Ryan carved a path as a biostatistician 
assessing environmental risks to human health. As she 
returns to her native Australia to head the maths and 
information-sciences division for the Commonwealth 
Scientific and Industrial Research Organisation, she plans 
to create opportunities for like-minded students. 

As an undergraduate, Ryan began actuarial studies at 
Macquarie University in Sydney, Australia. She lost interest 
in the financial focus at the same time that she became 
captivated by statistics. “The real world is chaotic, and! 
loved coming up with simple formulae able to reveal the 
complexity of randomness,” she says. 

Ryan's undergraduate adviser, Don McNeil, encouraged 
her to pursue a PhD in the United States. At Harvard 
University, Ryan developed diagnostic techniques to 
determine which statistical models are most appropriate 
for specific data sets, but this focus was too theoretical to 
sustain her interest. Through McNeil, she met members of 
Harvard's biostatistics department and began a postdoc 
with Stephen Lagakos, who was working to quantify cancer 
risks of red dye 40, a food-colouring agent. At the same 
time, she worked on statistical underpinnings of clinical- 
trial design at the nearby Dana-Farber Cancer Institute. 

Ryan's reputation for conducting quality environmental 
risk-assessments got her a seat on the National Academy 
of Sciences (NAS) panel on arsenic in drinking water, where 
she used her statistical expertise to quantify the effects 
associated with models’ uncertainty. She used similar 
techniques to help a NAS panel advise on methylmercury. 
“Louise cares deeply about making the world a better 
place,” says Lagakos. 

While at Harvard, Ryan headed student programmes, 
including one that gave minority students opportunities in 
biostatistics. After directing an environmental statistics 
programme, she went on to chair the biostatistics 
department. She remains an adjunct professor at Harvard. 

Her main goal now is stimulating top-notch statistics 
research opportunities in Australia. Already, she has 
started a graduate fellows programme to attract talented 
undergraduates who want work experience on the way toa 
PhD — a missing link in Australia’s maths career pipeline. 
“| want to make this division a magnet for the best talent,” 
Ryan says. | | 
Virginia Gewin 
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A bill to care for carers 


Many scientists, especially women, 
are familiar with the disruption to 
their career and funding that can 
occur as a result of extended family 
leave. To address this and the 
discrimination that sometimes 
results, Congresswoman Eddie 
Bernice Johnson (Democrat, Texas) 
has introduced a bill seeking grant 
extensions for federally funded 
scientists who take time off to care for 
family members. Funding agencies 
would also provide money to hire 
temporary workers. 

Johnson based the bill on 
recommendations in the 2007 
National Academies report Beyond 
Bias and Barriers. This is her second 
attempt; last year’s bill died in 
committee. With the new US 
administration, she says, “I think we 
might be trying at a better time now”. 

The bill calls for workshops to 
educate university leaders, grant 
reviewers and other federally funded 
researchers about how to reduce 
gender bias. Some universities have 
baulked at the suggested workshops’ 
time-consuming data-collection 
requirements, which include 
monitoring the careers of female 
faculty members from workshop 
participants’ departments. 

Many institutions have not 
compiled such data before, says Joan 


Herbers, a biologist at Ohio State 
University in Columbus ona National 
Science Foundation grant aimed at 
boosting participation of women in 
science and engineering. 

Some agencies, including the 
National Institutes of Health, already 
extend grants while investigators 
are caring for family. Provisions for 
temporary workers, however, would 
be new, says Mary Ann Mason, 
co-director of the Center on Health, 
Economic and Family Security at the 
University of California, Berkeley. 
Although researchers on multi-year 
grants cannot be easily replaced, 
Mason says hiring a temporary 
worker may help in some cases, and 
could limit concerns about hiring 
women of child-bearing age. “It's not 
an easy fix by any means,” she says. 
“But it's anecessary policy.” 

The bill's provisions are not limited 
to women. Men are increasingly 
requesting paternity leave, notes 
Phoebe Leboy, president of the 
Association for Women in Science. 
“The best thing that can happen for 
women in science,” she says, “is for 
men in science to have similar 
responsibilities and a better 
understanding of what the problems 
are.” Johnson hopes for action this 
congressional session. | 
Heidi Ledford 
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As simple as possible 


Sitting in an empty lab, surrounded by a dark campus, | think to myself: “Why 
do! want to bea scientist?” | recall a story my father told me. Unclear about the 
practical application of a difficult measurement, someone asked physiologist 
A.V. Hill: why did you do the experiment? Hill paused, then said: “Because it 
amused me." | want to be a scientist to satisfy my curiosity about the world. 

Staring at my first assistant-professor application, | think of what amuses me 
and satisfies my curiosity. | started my current project with a complex conceptual 
model for muscle activation. Recalling Henry David Thoreau’s advice to 
“simplify, simplify”, | stripped away the complexity with careful assumptions. 
Then | found an analytical expression whose taut curve hugged the numerically 
generated points of the complex model. This process of making a problem, in 
Einstein's words, “as simple as possible, but not simpler” is what amuses me. 

| send my application into cyberspace and wonder about my prospects. 
Late last year, theoretical and applied mechanics at Cornell, the department 
where | received my PhD, ceased to exist. The dean merged it into the more 
experiment-driven (and more fundable) mechanical engineering department. 
Is there a place for someone whose passion lies more on the theoretical side? 
Must | ignore what amuses me in favour of what makes me marketable? | 
Sam Walcott is a postdoc in theoretical biophysics at Johns Hopkins University in 
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Takeaway 


A word to the wise. 


Tony Ballantyne 


Stephanie stretched the phone cord to its 
full extent and leant around the doorway, 
gazing into the tiny kitchen beyond. 

“There’s a customer on the phone want- 
ing two sweet-and-sour porks delivered” 

Mr Ho shook fried rice from his wok, 
expertly filling a silver container. 

“Why you telling me that, Stephanie? 
Can't you see I’m busy?” 

“I know, Mr Ho, but the customer 
wants the order delivered to the anthill on 
Stonebreak Lane.” 

Mr Ho frowned. “Ah, that is unusual. 
Ants usually prefer lemon chicken. Tell 
them ten minutes.” 

Stephanie looked at Mr Ho. It wasn't like 
him to make jokes. It wasn't like him to do 
anything except to cookand to chastise her 
for not working hard enough. 

“Are you winding me up, Mr Ho?” 

“No, Stephanie.” 

Stephanie held his gaze for a moment, 
then she shrugged. 

“Ten minutes,” she said into the phone, 
and put it down. Mr Ho began to lecture 
her. 

“Your trouble, Stephanie, is you lazy. 
Think too much about boys and not 
enough about work. Tell me, you learn 
speak French at school?” 

Stephanie came into the kitchen and 
leant on the counter as Mr Ho rinsed the 
black wok under the tap. Steam hissed up. 

“Learn French?” said Stephanie. “Why 
should I when Ican run Transl-8 or Lebab 
or Syntactix? Who learns languages 
nowadays?” 

The wok was already back on the 
flame. 

“Wise people. Look at me. I come to 
England, I learn to speak the language 
properly. I don’t use computer or headset 
every day. I want to be treated as English- 
man, not foreigner.” 

“I don't see what that’s got to do with the 
ants.” 

“How you think they speak English?” 
Mr Ho dropped a handful of onions and 
pork into the wok. “You think ants learn 
for themselves?” 

“No. But I didn’t think ants could use 
computers.” 

“Of course they can't. But software is con- 
text sensitive. If you learned languages, you 
would understand. Words aren't enough, 
need context too. Like when you say to me 
‘nice shirt, Mr Ho, the other day. You being 
sarcastic girl, think I not realize” 


“No I wasnt...” 

“You think I not clever because I 
speak with accent. But translating 
software know about accent and 
everything else. Context is 
everything. And context get 
bigger and bigger. Not just tone 
of voice, but facial expression, set of 
body, the whole environment. What 
called the frame. And so translating 


software need to do more than just @i¢ 


listen to voice, but also to read body 
language and look at whole environ- 
ment. Ants part of environment. Small 
wonder software start to translate move- 
ments straight into phone message.’ 

“But why are ants ordering Chinese 
food?” 

“Because my food very tasty. I good 
cook. Very popular takeaway. You think 
they want to eat Mr Mahmood’s horrible 
curry? Rancid ghee and chilli powder.” 

“No! You know that’s not what I mean! 
Why aren't they out hunting for leaves or 
whatever it is ants eat?” 

“Why dont you go out and hunt or farm 
for food? Takeaway is easier. And tastier. 
And more efficient for ants. Nest is thriv- 
ing. Is now 100 yards across!” 

“What? How come I didn't know about 
that?” 

“You lazy girl. Only think about boys. 
Ants nest on television, you not see?” 

“No! But ... where do they get the money 
from?” 

“Ant nest is near computer assembly 
factory. Ants ideal for manipulating small 
parts. Cheaper than machinery too. Ants 
don't ask much money. Only get enough to 
buy tasty Chinese food” 

“That's ridiculous!” 

“No it’s not. Is animals adapting to new 
ecosystem. You not listen to biology either? 
Too busy speaking to boyfriend.” 

Mr Ho dropped a handful of pineapple 
and pepper into the wok. He shook it ina 
hissing of steam and rattle of frying. 

Stephanie bit her lip. “Are all the animals 
ordering takeaways, then?” 

“No. Most animals lazy. Won't move 
with times. Like relatives who stay at home 
and not grasp opportunities of Western 
world” 

Mr Ho picked up a ladle and, tossing the 
contents of the pan all the while, spooned 
three lots of red sauce over the ingredi- 
ents. 

“You're winding me up, right?” said 
Stephanie. “You're seriously telling me 
that because translation software looks 
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at the whole context of language, it has 
enabled communication between spe- 
cies, the results of which are that whole 
ecosystems are changing due to animals 
exploiting new opportunities. The result 
of this is that an ant’s nest is now order- 
ing takeaway food bought using money 
it earned by working in a microprocessor 
plant, this being a more viable means of 
existence?” 

“Elegantly put. See? You clever girl when 
you want to be.” 

“So why does that mean I should learn 
French?” 

“Ants not know language properly, not 
know wider context, so just respond to 
need. Get exploited. All that work in com- 
puter plant, and only get paid enough to 
buy takeaway.” 

The sweet and sour pork was ready. Mr 
Ho expertly divided the contents of the 
wok between two silver trays. 

“All done,’ he said. “Feed hungry ants. 
Very good customers. Ah! Here come 
delivery girl!” 

“Delivery girl?” said Stephanie. “What 
happened to Adam?” 

“Too expensive. And lazy. Prefer spend 
time chatting to you than delivering tasty 
food. Delilah more reliable.” 

Stephanie looked down at Delilah as she 
trotted into the little kitchen, tail wagging. 
She was very good, eyes to the front, not 
stopping to sniff anything. Mr Ho bagged 
the order, putting the handle into her 
mouth. 

“Good girl,” he said as the dog trotted 
out of the shop. “Work very hard” 


Stephanie watched her go. 
The next day, she signed up for French 
lessons. a 


Two of Tony Ballantyne’s most recent 
stories appear in The Year's Best SF 13, 
edited by David Hartwell and Kathryn 
Cramer. His next novel, Twisted Metal, is 
published in May by Macmillan. 
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